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5 CROSS-REFERENCE TO RELATED APPLICATION 

This application is a continuation-in-part of U.S. 
Serial No. 08/544,404 filed 10 October 1995, which is a 
continuation-in-part of U.S. Serial No. 08/352,322 filed 7 
December 1994, which is a continuation-in-part of U.S. Serial 

10 No. 08/209,741 filed March 9, 1994, which is a continuation- 
in-part of U.S. Serial No. 08/165,699 filed December 10, 1993, 
which is a continuation-in-part os U.S. Serial No. 08/161,739 
filed December 3, 1993,| which is a continuation-in-part of 
U.S. Serial No. 08/155,301 filed November 18, 1993, which is a 

15 continuation-in-part of U.S. Serial No. 08/096,762 filed July 
22, 1993, which is a continuation-in-part of U.S. Serial No. 
08/053,131 filed April 26, 1993, which is a continuation-in- 
part of U.S. Serial No. 07/990,860 filed December 16, 1992, 
which is a continuation-in-part of U.S. Serial No. 07/904,068 

20 filed June 23, 1992, which is a continuation-in-part of U.S. 
Serial No. 07/853,408 filed March 18, 1992, which is a 
continuation-in-part of U.S. Serial No. 07/810,279 filed 
December 17, 1991, which is a continuation-in-part of U.S. 
Serial No. 07/575,962 filed August 31, 1990 (now abandoned), 

25 which is a continuation-in-part of U.S. Serial No. 07/574,748 
filed August 29, 1990 (now abandoned) . This application 
claims foreign priority benefits under Title 35, United States 
Code, Section 119, to PCT Application No. PCT/US91/06185 
(which corresponds to U.S. Serial No. 07/834,53 9 filed 

30 February 5, 1992) and PCT Application No. PCT/US92/10983 . 

TECHNICAL FIELD 

The invention relates to transgenic non-human 
animals capable of producing heterologous antibodies, 
3 5 transgenes used to produce such transgenic animals, 

transgenes capable of functionally rearranging a heterologous 
D gene in V-D-J recombination, immortalized B-cells capable of 
producing heterologous antibodies, methods and transgenes for 
producing heterologous antibodies of multiple isotypes, 



methods and transgenes for producing heterologous antibodies 
wherein a variable region sequence comprises somatic mutation 
as compared to germline rearranged variable region sequences, 
transgenic nonhuman animals which produce antibodies having a 
human primary sequence and which bind to human antigens, 
hybridomas made from B cells of such transgenic animals, and 
monclonal antibodies expressed by such hybridomas. 

BACKGROUND OF THE INVENTION 

One of the major impediments facing the development 
of in vivo therapeutic and diagnostic applications for 
monoclonal antibodies in humans is the intrinsic 
immunogenicity of non-human immunoglobulins . For example, when 
immunocompetent human patients are administered therapeutic 
doses of rodent monoclonal antibodies, the patients produce 
antibodies against the rodent immunoglobulin sequences; these 
human anti-mouse antibodies (HAMA) neutralize the therapeutic 
antibodies and can cause acute toxicity. Hence, it is 
desirable to produce human immunoglobulins that are reactive 
with specific human antigens that are promising therapeutic 
and/or diagnostic targets. However, producing human 
immunoglobulins that bind specifically with human antigens is 
problematic . 

The present technology for generating monoclonal 
antibodies involves pre-exposing, or priming, an animal 
(usually a rat or mouse) with antigen, harvesting B-cells from 
that animal, and generating a library of hybridoma clones. By 
screening a hybridoma population for antigen binding 
specificity (idiotype) and also screening for immunoglobulin 
class (isotype), it is possible to select hybridoma clones 
that secrete the desired antibody. 

However, when present methods for generating 
monoclonal antibodies are applied for the purpose of 
generating human antibodies that have binding specificities 
for human antigens, obtaining B-lymphocytes which produce 
human immunoglobulins a serious obstacle, since humans will 
typically not make immune responses against self -antigens . 



Hence, present methods of generating human 
monoclonal antibodies that are specifically reactive with 
human antigens are clearly insufficient. It is evident that 
the same limitations on generating monoclonal antibodies to 
5 authentic self antigens apply where non-human species are used 
as the source of B-cells for making the hybridoma. 

The construction of transgenic animals harboring a 
functional heterologous immunoglobulin transgene are a method 
by which antibodies reactive with self antigens may be 

10 produced. However, in order to obtain expression of 

therapeutically useful antibodies, or hybridoma clones 
producing such antibodies, the transgenic animal must produce 
transgenic B cells that are capable of maturing through the B 
lymphocyte development pathway. Such maturation requires the 

15 presence of surface IgM on the transgenic B cells, however 
isotypes other than IgM are desired for therapeutic uses. 
Thus, there is a need for transgenes and animals harboring 
such transgenes that are able to undergo functional V-D-J 
rearrangement to generate recombinational diversity and 

20 junctional diversity. Further, such transgenes and transgenic 
animals preferably include cis-acting sequences that 
facilitate isotype switching from a first isotype that is 
required for B cell maturation to a subsequent isotype that 
has superior therapeutic utility. 

2 5 A number of experiments have reported the use of 

transfected cell lines to determine the specific DNA sequences 
required for Ig gene rearrangement (reviewed by Lewis and 
Gellert (1989) , Cell , 59, 585-588) . Such reports have 
identified putative sequences and concluded that the 

3 0 accessibility of these sequences to the recombinase enzymes 

used for rearrangement is modulated by transcription 
(Yancopoulos and Alt (1985), Cell . 40, 271-281). The 
sequences for V(D)J joining are reportedly a highly conserved, 
near-palindromic heptamer and a less well conserved AT-rich 
3 5 nanomer separated by a spacer of either 12 or 23 bp (Tonegawa 
(1983), Nature, 302, 575-581; Hesse, et al. (1989), Genes in 
Dev. , 3., 1053-1061) . Efficient recombination reportedly 



occurs only between sites containing recombination signal 
sequences with different length spacer regions. 

Ig gene rearrangement, though studied in tissue 
culture cells, has not been extensively examined in transgenic 
mice* Only a handful of reports have been published 
describing rearrangement test constructs introduced into mice 
[Buchini, et al. (1987), Nature , 326 , 409-411 (unrearranged 
chicken X transgene) ; Goodhart, et al. (1987) , Froc. Natl. 
Acad. Sci. USA , 84/ 4229-4233) (unrearranged rabbit k gene) ; 
and Bruggemann, et al. (1989), Proc. Natl. Acad . Sci. USA, 86, 
67 09-6713 (hybrid mouse-human heavy chain) ] . The results of 
such experiments, however, have been variable, in some cases, 
producing incomplete or minimal rearrangement of the 
transgene . 

Further, a variety of biological functions of 
antibody molecules are exerted by the Fc portion of molecules, 
such as the interaction with mast cells or basophils through 
Fee, and binding of complement by Fcjtx or Fcy, it further is 
desirable to generate a functional diversity of antibodies of 
a given specificity by variation of isotype. 

Although transgenic animals have been generated that 
incorporate transgenes encoding one or more chains of a 
heterologous antibody, there have been no reports of 
heterologous transgenes that undergo successful isotype 
switching. Transgenic animals that cannot switch isotypes are 
limited to producing heterologous antibodies of a single 
isotype, and more specifically are limited to producing an 
isotype that is essential for B cell maturation, such as IgM 
and possibly IgD, which may be of limited therapeutic utility. 
Thus, there is a need for heterologous immunoglobulin 
transgenes and transgenic animals that are capable of 
switching from an isotype needed for B cell development to an 
isotype that has a desired characteristic for therapeutic use. 

Based on the foregoing, it is clear that a need 
exists for methods of efficiently producing heterologous 
antibodies, e.g. antibodies encoded by genetic sequences of a 
first species that are produced in a second species. More 
particularly, there is a need in the art for heterologous 



immunoglobulin transgenes and transgenic animals that are 
capable of undergoing functional V-D-J gene rearrangement that 
incorporates all or a portion of a D gene segment which 
contributes to recombinational diversity. Further, there is a 
need in the art for transgenes and transgenic animals that can 
support V-D-J recombination and isotype switching so that (1) 
functional B cell development may occur, and (2) 
therapeutically useful heterologous antibodies may be 
produced. There is also a need for a source of B cells which 
can be used to make hybridomas that produce monoclonal 
antibodies for therapeutic or diagnostic use in the particular 
species for which they are designed. A heterologous 
immunoglobulin transgene capable of functional V-D-J 
recombination and/or capable of isotype switching could 
fulfill these needs. 

In accordance with the foregoing object transgenic 
nonhuman animals are provided which are capable of producing a 
heterologous antibody, such as a human antibody. 

Further, it is an object to provide B-cells from 
such transgenic animals which are capable of expressing 
heterologous antibodies wherein such B-cells are immortalized 
to provide a source of a monoclonal antibody specific for a 
particular antigen. 

In accordance with this foregoing object, it is a 
further object of the invention to provide hybridoma cells 
that are capable of producing such heterologous monoclonal 
antibodies . 

Still further, it is an object herein to provide 
heterologous unrearranged and rearranged immunoglobulin heavy 
and light chain transgenes useful for producing the 
aforementioned non-human transgenic animals. 

Still further, it is an object herein to provide 
methods to disrupt endogenous immunoglobulin loci in the 
transgenic animals . 

Still further, it is an object herein to provide 
methods to induce heterologous antibody production in the 
aforementioned transgenic non-human animal. 



A further object of the invention is to provide 
methods to generate an immunoglobulin variable region gene 
segment repertoire that is used to construct one or more 
transgenes of the invention. 

The references discussed herein are provided solely 
for their disclosure prior to the filing date of the present 
application. Nothing herein is to be construed as an 
admission that the inventors are not entitled to antedate such 
disclosure by virtue of prior invention. 

SUMMARY OF THE INVENTION 
Transgenic nonhuman animals are provided which are 
capable of producing a heterologous antibody, such as a human 
antibody. Such heterologous antibodies may be of various 
isotypes, including: IgGl, lgG2 , IgG3 , IgG4, IgM, IgAl, IgA2 , 
IgA sec/ IgD, of IgE. In order for such transgenic nonhuman 
animals to make an immune response, it is necessary for the 
transgenic B cells and pre-B cells to produce surface-bound 
immunoglobulin, particularly of the IgM (or possibly IgD) 
isotype, in order to effectuate B cell development and 
antigen-stimulated maturation. Such expression of an IgM (or 
IgD) surface-bound immunoglobulin is only required during the 
antigen-stimulated maturation phase of B cell development, and 
mature B cells may produce other isotypes, although only a 
single switched isotype may be produced at a time. 

Typically, a cell of the B-cell lineage will produce 
only a single isotype at a time, although cis or trans 
alternative RNA splicing, such as occurs naturally with the /z s 
(secreted jz) and ^ M (membrane-bound m) forms, and the & and 8 
immunoglobulin chains, may lead to the contemporaneous 
expression of multiple isotypes by a single cell. Therefore, 
in order to produce heterologous antibodies of multiple 
isotypes, specifically the therapeutically useful IgG, IgA, 
and IgE isotypes, it is necessary that isotype switching 
occur. Such isotype switching may be classical class- 
switching or may result from one or more non-classical isotype 
switching mechanisms. 



The invention provides heterologous immunoglobulin 
transgenes and transgenic nonhuman animals harboring such 
transgenes, wherein the transgenic animal is capable of 
producing heterologous antibodies of multiple isotypes by 
undergoing isotype switching. Classical isotype switching 
occurs by recombination events which involve at least one 
switch seguence region in the transgene. Non-classical 
isotype switching may occur by, for example, homologous 
recombination between human and human E M sequences (8- 
associated deletion) . Alternative non-classical switching 
mechanisms, such as intertransgene and/or interchromosomal 
recombination, among others, may occur and effectuate isotype 
switching* Such transgenes and transgenic nonhuman animals 
produce a first immunoglobulin isotype that is necessary for 
antigen-stimulated B cell maturation and can switch to encode 
and produce one or more subsequent heterologous isotypes that 
have therapeutic and/or diagnostic utility. Transgenic 
nonhuman animals of the invention are thus able to produce, in 
one embodiment, IgG, IgA, and/ or IgE antibodies that are 
encoded by human immunoglobulin genetic sequences and which 
also bind specific human antigens with high affinity. 

The invention also encompasses B-cells from such 
transgenic animals that are capable of expressing heterologous 
antibodies of various isotypes, wherein such B-cells are 
immortalized to provide a source of a monoclonal antibody 
specific for a particular antigen. Hybridoma cells that are 
derived from such B-cells can serve as one source of such 
heterologous monoclonal antibodies. 

The invention provides heterologous unrearranged and 
rearranged immunoglobulin heavy and light chain transgenes 
capable of undergoing isotype switching in vivo in the 
aforementioned non-human transgenic animals or in explanted 
lymphocytes of the B-cell lineage from such transgenic 
animals. Such isotype switching may occur spontaneously or be 
induced by treatment of the transgenic animal or explanted B- 
lineage lymphocytes with agents that promote isotype 
switching, such as T-cell-derived lymphokines (e.g., IL-4 and 
IFN y) . 
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Still further, the invention includes methods to 
induce heterologous antibody production in the aforementioned 
transgenic non-human animal, wherein such antibodies may be of 
various isotypes. These methods include producing an antigen- 
stimulated immune response in a transgenic nonhuman animal for 
the generation of heterologous antibodies, particularly 
heterologous antibodies of a switched isotype (i.e., IgG, IgA, 
and IgE) . 

This invention provides methods whereby the 
transgene contains sequences that effectuate isotype 
switching, so that the heterologous immunoglobulins produced * 
in the transgenic animal and monoclonal antibody clones 
derived from the B-cells of said animal may be of various 
isotypes . 

This invention further provides methods that 
facilitate isotype switching of the transgene, so that 
switching between particular isotypes may occur at much higher 
or lower frequencies or in different temporal orders than 
typically occurs in germline immunoglobulin loci. Switch 
regions may be grafted from various C H genes and ligated to 
other C H genes in a transgene construct; such grafted switch 
sequences will typically function independently of the 
associated C H gene so that switching in the transgene 
construct will typically be a function of the origin of the 
associated switch regions. Alternatively, or in combination 
with switch sequences, ^-associated deletion sequences may be 
linked to various C H genes to effect non-classical switching 
by deletion of sequences between two ^-associated deletion 
sequences. Thus, a transgene may be constructed so that a 
particular C H gene is linked to a different switch sequence 
and thereby is switched to more frequently than occurs when 
the naturally associated switch region is used. 

This invention also provides methods to determine 
whether isotype switching of transgene sequences has occurred 
in a transgenic animal containing an immunoglobulin transgene. 

The invention provides immunoglobulin transgene 
constructs and methods for producing immunoglobulin transgene 
constructs, some of which contain a subset of germline 



immunoglobulin loci sequences (which may include deletions) . 
The invention includes a specific method for facilitated 
cloning and construction of immunoglobulin transgenes, 
involving a vector that employs unique Xhol and Sail 
restriction sites flanked by two unique NotI sites. This 
method exploits the complementary termini of Xhol and Sail 
restrictions sites and is useful for creating large constructs 
by ordered concatemerization of restriction fragments in a 
vector . 

The transgenes of the invention include a heavy 
chain transgene comprising DNA encoding at least one variable 
gene segment, one diversity gene segment, one joining gene 
segment and one constant region gene segment. The 
immunoglobulin light chain transgene comprises DNA encoding at 
least one variable gene segment, one joining gene segment and 
one constant region gene segment. The gene segments encoding 
the light and heavy chain gene segments are heterologous to 
the transgenic non-human animal in that they are derived from, 
or correspond to, DNA encoding immunoglobulin heavy and light 
chain gene segments from a species not consisting of the 
transgenic non-human animal. In one aspect of the invention, 
the transgene is constructed such that the individual gene 
segments are unrearranged, i.e., not rearranged so as to 
encode a functional immunoglobulin light or heavy chain. Such 
unrearranged transgenes permit recombination of the gene 
segments (functional rearrangement) and expression of the 
resultant rearranged immunoglobulin heavy and/ or light chains 
within the transgenic non-human animal when said animal is 
exposed to antigen. 

In one aspect of the invention, heterologous heavy 
and light immunoglobulin transgenes comprise relatively large 
fragments of unrearranged heterologous DNA. Such fragments 
typically comprise a substantial portion of the C, J (and in 
the case of heavy chain, D) segments from a heterologous 
immunoglobulin locus. In addition, such fragments also 
comprise a substantial portion of the variable gene segments. 

In one embodiment, such transgene constructs 
comprise regulatory sequences, e.g. promoters, enhancers, 
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class switch regions, recombination signals and the like, 
corresponding to sequences derived from the heterologous DNA. 
Alternatively, such regulatory sequences may be incorporated 
into the transgene from the same or a related species of the 
non-human animal used in the invention. For example, human 
immunoglobulin gene segments may be combined in a transgene 
with a rodent immunoglobulin enhancer sequence for use in a 
transgenic mouse. 

In a method of the invention, a transgenic non-human 
animal containing germline unrearranged light and heavy 
immunoglobulin transgenes - that undergo VDJ joining during 
D-cell differentiation - is contacted with an antigen to 
induce production of a heterologous antibody in a secondary 
repertoire B-cell . 

Also included in the invention are vectors and 
methods to disrupt the endogenous immunoglobulin loci in the 
non-human animal to be used in the invention. Such vectors 
and methods utilize a transgene, preferably positive-negative 
selection vector, which is constructed such that it targets 
the functional disruption of a class of gene segments encoding 
a heavy and/or light immunoglobulin chain endogenous to the 
non-human animal used in the invention. Such endogenous gene 
segments include diversity, joining and constant region gene 
segments. In this aspect of the invention, the 
positive-negative selection vector is contacted with at least 
one embryonic stem cell of a non-human animal after which 
cells are selected wherein the positive-negative selection 
vector has integrated into the genome of the non-human animal 
by way of homologous recombination. After transplantation, 
the resultant transgenic non-human animal is substantially 
incapable of mounting an immunoglobulin-mediated immune 
response as a result of homologous integration of the vector 
into chromosomal DNA. Such immune deficient non-human animals 
may thereafter be used for study of immune deficiencies or 
used as the recipient of heterologous immunoglobulin heavy and 
light chain transgenes. 

The invention also provides vectors, methods, and 
compositions useful for suppressing the expression of one or 
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more species of immunoglobulin chain (s) , without disrupting an 
endogenous immunoglobulin locus. Such methods are useful for 
suppressing expression of one or more endogenous 
immunoglobulin chains while permitting the expression of one 
5 or more transgene-encoded immunoglobulin chains. Unlike 
genetic disruption of an endogenous immunoglobulin chain 
locus, suppression of immunoglobulin chain expression does not 
require the time-consuming breeding that is needed to 
establish transgenic animals homozygous for a disrupted 

10 endogenous Ig locus. An additional advantage of suppression 
as compared to engognous Ig gene disruption is that, in 
certain embodiments, chain suppression is reversible within an 
individual animal. For example, Ig chain suppression may be 
accomplished with: (1) transgenes encoding and expressing 

15 antisense RNA that specifically hybridizes to an endogenous Ig 
chain gene sequence, (2) antisense oligonucleotides that 
specifically hybridize to an endogenous Ig chain gene 
sequence, and (3) immunoglobulins that bind specifically to an 
endogenous Ig chain polypeptide. 

20 The invention provides transgenic non-human animals 

comprising: a homozygous pair of functionally disrupted 
endogenous heavy chain alleles, a homozygous pair of 
functionally disrupted endogenous light chain alleles, at 
least one copy of a heterologous immunoglobulin heavy chain 

25 transgene, and at least one copy of a heterologous 

immunoglobulin heavy chain transgene, wherein said animal 
makes an antibody response following immunization with an 
antigen, such as a human antigen (e.g., CD4) . The invention 
also provides such a transgenic non-human animal wherein said 

3 0 functionally disrupted endogenous heavy chain allele is a J H 
region homologous recombination knockout, said functionally 
disrupted endogenous light chain allele is a J K region 
homologous recombination knockout, said heterologous 
immunoglobulin heavy chain transgene is the HC1 or HC2 human 

3 5 minigene transgene, said heterologous light chain transgene 

is the KC2 or KCle human k transgene, and wherein said antigen 
is a human antigen. 
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The invention also provides various embodiments for 
suppressing, ablating, and/or functionally disrupting the 
endogenous nonhuman immunoglobulin loci. 

The invention also provides transgenic mice 
expressing both human sequence heavy chains and chimeric heavy 
chains comprising a human sequence heavy chain variable region 
and a murine sequence heavy chain constant region. Such 
chimeric heavy chains are generally produced by trans- 
switching between a functionally rearranged human transgene 
and an endogenous murine heavy chain constant region (e.g., 
7I, 72a, 72b, 73) . Antibodies comprising such chimeric heavy 
chains, typically in combination with a transgene-encoded 
human sequence light chain or endogenous murine light chain, 
are formed in response to immunization with a predetermined 
antigen. The transgenic mice of these embodiments can 
comprise B cells which produce (express) a human sequence 
heavy chain at a first timepoint and trans-switch to produce 
(express) a chimeric heavy chain composed of a human variable 
region and a murine constant region (e.g., 7I, 72a, 72b, 73) 
at a second (subsequent) timepoint; such human sequence and 
chimeric heavy chains are incorporated into functional 
antibodies with light chains; such antibodies are present in 
the serum of such transgenic mice. Thus, to restate: the 
transgenic mice of these embodiments can comprise B cells 
which express a human sequence heavy chain and subsequently 
switch (via trans-switching or cis-switching) to express a 
chimeric or isotype-switched heavy chain composed of a human 
variable region and a alternative constant region (e.g., 
murine 71, 72a, 72b, 73; human 7, a, €) ; such human sequence 
and chimeric or isotype-switched heavy chains are incorporated 
into functional antibodies with light chains (human or mouse) ; 
such antibodies are present in the serum of such transgenic 
mice. 

The invention also provides a method for generating 
a large transgene, said method comprising: 

introducing into a mammalian cell at least three 
polynucleotide species; a first polynucleotide species having 
a recombinogenic region of sequence identity shared with a 



second polynucleotide species, a second polynucleotide species 
having a recombinogenic region of sequence identity shared 
with a first polynucleotide species and a recombinogenic 
region of sequence identity shared with a third polynucleotide 
5 species, and a third polynucleotide species having a 

recombinogenic region of sequence identity shared with said 
second polynucleotide species. 

Recombinogenic regions are regions of substantial 
sequence identity sufficient to generate homologous 

10 recombination in vivo in a mammalian cell (e.g., ES cell), and 
preferably also in non-mammalian eukaryotic cells (e.g., 
Saccharaomyces and other yeast or fungal cells) . Typically, 
recombinogenic regions are at least 50 to 100000 nucleotides 
long or longer, preferably 500 nucleotides to 10000 

15 nucleotides long, and are often 80-100 percent identical, 
frequently 95-100 percent identical, often isogenic. 

The references discussed herein are provided solely 
for their disclosure prior to the filing date of the present 
application. Nothing herein is to be construed as an 

2 0 admission that the inventors are not entitled to antedate such 

disclosure by virtue of prior invention. 

BRIEF DESCRIPTION OF THE FIGURES 
Fig. 1 depicts the complementarity determining 
25 regions CDR1, CDR2 and CDR3 and framework regions FR1, FR2, 
FR3 and FR4 in unrearranged genomic DNA and mRNA expressed 
from a rearranged immunoglobulin heavy chain gene, 
Fig. 2 depicts the human X chain locus, 
Fig. 3 depicts the human k chain locus, 

3 0 Fig. 4 depicts the human heavy chain locus, 

Fig. 5 depicts a transgene construct containing a 
rearranged IgM gene ligated to a 25 kb fragment that contains 
human 73 and 7I constant regions followed by a 700 bp fragment 
containing the rat chain 3 1 enhancer sequence. 
3 5 Fig. 6 is a restriction map of the human k chain 

locus depicting the fragments to be used to form a light chain 
transgene by way of in vivo homologous recombination. 

Fig. 7 depicts the construction of pGPl. 
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Fig. 8 depicts the construction of the polylinker 
contained in pGPl. 

Fig. 9 depicts the fragments used to construct a 
human heavy chain transgene of the invention. 
5 Fig. 10 depicts the construction of pHIGl and pCONl. 

Fig. 11 depicts the human C7I fragments which are 
inserted into pRE3 (rat enhancer 3') to form pREG2 . 

Fig. 12 depicts the construction of pHIG3 1 and PCON. 

Fig. 13 depicts the fragment containing human D 
10 region segments used in construction of the transgenes of the 
invention. 

Fig. 14 depicts the construction of pHIG2 (D segment 
containing plasmid) . 

Fig. 15 depicts the fragments covering the human Jk 
15 and human Ck gene segments used in constructing a transgene of 
the invention . 

Fig. 16 depicts the structure of pE/i. 

Fig. 17 depicts the construction of pKapH. 

Figs. 18A through 18D depict the construction of a 

2 0 positive-negative selection vector for functionally disrupting 

the endogenous heavy chain immunoglobulin locus of mouse. 

Figs. 19A through 19C depict the construction of a 
positive-negative selection vector for functionally disrupting 
the endogenous immunoglobulin light chain loci in mouse. 
25 Figs. 2 OA through 2 0E depict the structure of a 

kappa light chain targeting vector. 

Figs. 21A through 2 IF depict the structure of a 
mouse heavy chain targeting vector. 

Fig. 22 depicts the map of vector pGPe. 

3 0 Fig. 2 3 depicts the structure of vector p JM2 . 

Fig. 24 depicts the structure of vector pCORl. 
Fig. 2 5 depicts the transgene constructs for pIGMl, 
pHCl and pHC2 . 

Fig. 26 depicts the structure of P7e2 . 
35 Fig. 27 depicts the structure of pVGEl. 

Fig. 2 8 depicts the assay results of human Ig 
expression in a pHCl transgenic mouse. 

Fig. 29 depicts the structure of pJCKl. 



Fig. 3 0 depicts the construction of a synthetic 
heavy chain variable region* 

Fig. 31 is a schematic representation of the heavy 
chain minilocus constructs pIGM lf pHCl, and pHC2 . 

Fig. 3 2 is a schematic representation of the heavy 
chain minilocus construct pIGGl and the k light chain 
minilocus construct pKCl, pKVel, and pKC2 ♦ 

Fig. 3 3 depicts a scheme to reconstruct functionally 
rearranged light chain genes. 

Fig. 3 4 depicts serum ELISA results 

Fig. 3 5 depicts the results of an ELISA assay of 
serum from 8 transgenic mice. 

Fig. 3 6 is a schematic representation ,of plasmid 

pBCEl. 

Figs. 37A-37C depict the immune response of 
transgenic mice of the present invention against KLH-DNP, by 
measuring IgG and IgM levels specific for KLH-DNP (37A) , KLH 
(37B) and BSA-DNP (37C) . 

Fig. 3 8 shows ELISA data demonstrating the presence 
of antibodies that bind human carcinoembryonic antigen (CEA) 
and comprise human /x chains; each panel shows reciprocal 
serial dilutions from pooled serum samples obtained from mice 
on the indicated day following immunization. 

Fig. 3 9 shows ELISA data demonstrating the presence 
of antibodies that bind human carcinoembryonic antigen (CEA) 
and comprise human y chains; each panel shows reciprocal 
serial dilutions from pooled serum samples obtained from mice 
on the indicated day following immunization. 

Fig. 40 shows aligned variable region sequences of 
2 3 randomly-chosen cDNAs generated from xaRNA obtained from 
lymphoid tissue of HC1 transgenic mice immunized with human 
carcinoembryonic antigen (CEA) as compared to the germline 
transgene sequence (top line) ; on each line nucleotide changes 
relative to germline sequence are shown. The regions 
corresponding to heavy chain CDR1 , CDR2 , and CDR3 are 
indicated. Non-germline encoded nucleotides are shown in 
capital letters . 
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Fig. 4 1 show the nucleotide sequence of a. human DNA 
fragment r designated vk65 - 3 , containing a. V K gene segment ; the 
deduced amino ac id seguences of the V K coding regions are also 
shown ; sol icing and recombination signal seguences 
5 ( hep tamer / nonamer ) are shown boxed. 

Fig;. 42 show the nucleotide seguence of a human DNA 
fragment, designated vk.65.5, conta ining a gene segment ; the 

deduced amino acid seguences of the V K coding regions are also 
shown; splicing and recombination signal sequences 
1 O ( heptamer / nonamer ) are shown boxed - 

Fig. 4 3 show the nuc leotide seguence of a human DNA 
fragment, designated vk65.8, containing a gene segment ; the 

deduced amino acid seguences of the V K coding regions are also 
shown ; splicing and recombination signal seguences 
15 ( hep tamer / nonamer ) are shown boxed. 

Fig. 44 show the nucleotide seguence of a human DNA 
fragment, designated vk6 5 . 15, containing a V K gene segment; 
the deduced amino acid sequences of the V K coding regions are 
also shown ; splicing and recombination signal seguences 
20 ( heptamer / nonamer ) are shown boxed - 

Fig. 4 5 shows formation of a light chain mini locus 
by homologous recombination between two overlapping fragments 
which were co-inj ected . 

Fig. 46 shows ELISA results for monoclonal 

2 5 antibodies reactive with CEA and non — CEA antigens showing the 

specificity of antigen binding. 

Fig. 4*7 shows the DNA seguences of 1 0 cDNAs 
amplified by PCR to ampl if y transcripts having a human VDJ and 
a murine constant region sequence - 

3 O Fig. 4 8 shows ELI SA results for various dilutions of 

serum obtained from mice bearing both a human heavy chain 
mini locus transgene and a human k mini locus transgene ; the 
mouse was immunized with human CD4 and the data shown 
represents antibodies reactive with human CD4 and possessing 
35 human k , human ju , or human ~y epitopes, respectively. 

Fig. 4 9 shows relative distribution of lymphocytes 
staining for human fj, or mouse jj. as determined by FACS for 
three mouse genotypes . 
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Fig- 5 O shows relative distribution of lymphocytes 
staining for human k or mouse k as determined by FACS for 
three mouse genotypes - 

Fig. 51 shows relative distribution of lymphocytes 
5 sta in ing for mouse X as determined by FACS for three mouse 
genotypes . 

Fig- 52 shows relative distribution of lymphocytes 
staining for mouse X or human k as determined by FACS for four 
mouse genotypes . 

1 0 Fig- 53 shows the amounts of human jj, , human *y , human 

k, mouse /x, mouse -y , mouse k , and mouse X chains in the serum 

of un immunized O 0 11 mice. 

Fig. 54 shows a scatter plot showing the amounts of 

human jlx , human -y , human k , mouse fjL , mouse -y , mouse k , and 
15 mouse X cha ins in the serum of unimmunized O 0 1 1 mice of 

various genotypes . 

Fig. 55 shows the titres of antibodies comprising 

human fj. , human -y , or human k chains in anti — CD4 antibodies in 

the serum taken at three weeks or seven weeks pos t— 
2 O immunization following immun i z at ion of a OOll mouse with human 

CD4 . 

Fig- 5 6 shows a schema t ic representation of the 
human heavy cha in mini locus transgenes pHCl and pHC2 , and the 
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Structural genes are shown by closed boxes in the top line; 
second and third lines show restriction sites with symbols 
indicated. 

Fig. 61 shows a nucleotide sequence of mouse heavy 
chain locus a constant region gene. 

Fig. 62 shows the construction of a frameshift 
vector (plasmid B) for introducing a two bp frameshift into 
the murine heavy chain locus J 4 gene. 

Fig. 63 shows isotype specific response of 
transgenic animals during hyper immunization. The relative 
levels of reactive human \i and 71 are indicated by a 
colorimetric ELISA assay (y-axis) . We immunized three 7-10 
week old male HC1 line 57 transgenic animals (#1991, #2356, 
#2357) , in a homozygous JHD background, by intraperitoneal 
injections of CEA in Freund's adjuvant. The figure depicts 
binding of 250 fold dilutions of pooled serum (collected prior 
to each injection) to CEA coated microtiter wells. 

Fig. 64A and 64B show expression of transgene 
encoded 7I isotype mediated by class switch recombination. 
The genomic structure of integrated transgenes in two 
different human 7I expressing hybridomas is consistent with 
recombination between the \i and 71 switch regions. Fig. 64A 
shows a Southern blot of Pacl/Sfil digested DNA isolated from 
three transgene expressing hybridomas. From left to right: 
clone 92-09A-5H1-5, human 7l + /M~; clone 92-90A-4G2-2 , human 
7l + /M"; clone 92-09A-4F7-A5-2 , human 71", M + - All three 
hybridomas are derived from a 7 month old male mouse 
hemizygous for the HC1-57 integration, and homozygous for the 
JHD disruption (mouse #1991) . The blot is hybridized with a 
probe derived from a 2 . 3 kb Bglll/Sfil DNA fragment spanning 
the 3' half of the human 7I switch region. No switch product 
is found in the \i expressing hybridoma, while the two 71 
expressing hybridomas, 92-09A-5H1-5 and 92-09A-4G2-2 , contain 
switch products resulting in Pacl/Sfil fragments of 5.1 and 
5.3 kb respectively, Fig. 64B is a diagram of two possible 
deletional mechanisms by which a class switch from /x to 7I can 
occur. The human (i gene is flanked by 4 00 bp direct repeats 
(a/x and E/x) which can recombine to delete m- Class switching 
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by this mechanism will always generate a 6.4 kb Pacl/Sfil 
fragment, while class switching by recombination between the n 
and the 7I switch regions will generate a Pacl/Sfil fragment 
between 4 and 7 kb, with size variation between individual 
switch events. The two yl expressing hybridomas examined in 
Fig. 64A appear to have undergone recombination between the [i 
and 7I switch regions. 

Fig. 65 shows chimeric human/mouse immunoglobulin 
heavy chains generated by trans-switching. cDNA clones of 
trans-switch products were generated by reverse transcription 
and PCR amplification of a mixture of spleen and lymph node 
RNA isolated from a hyper immunized HC1 transgenic-JHD mouse 
(#2357; see legend to Fig. 63 for description of animal and 
immunization schedule) . The partial nucleotide sequence of 10 
randomly picked clones is shown. Lower case letters indicate 
germline encoded, capital letters indicate nucleotides that 
cannot be assigned to known germline sequences; these may be 
somatic mutations, N nucleotides, or truncated D segments. 
Both face type indicates mouse 7 sequences. 

Figs. 66A and 66B show that the rearranged V.H251 
transgene undergoes somatic mutation in a hyper immunized. The 
partial nucleotide sequence of IgG heavy chain variable region 
cDNA clones from CHI line 26 mice exhibiting Fig. 66A primary 
and Fig. 66B secondary responses to antigen. Germline 
sequence is shown at the top; nucleotide changes from germline 
are given for each clone. A period indicates identity with 
germline sequence, capital letters indicate no identified 
germline origin. The sequences are grouped according to J 
segment usage. The germline sequence of each of the J 
segments if shown. Lower case letters within CDR3 sequences 
indicate identity to known D segment included in the HC1 
transgene. The assigned D segments are indicated at the end 
of each sequence. Unassigned sequences could be derived from 
N region addition or somatic mutation; or in some cases they 
are simply too short to distinguish random N nucleotides from 
known D segments. Fig. 66A primary response: 13 randomly 
picked VH251-7I cDNA clones. A 4 week old female HC1 line 
2 6-JHD mouse (#2 599) was given a single injection of KLH and 
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complete Freunds adjuvant; spleen cell RNA was isolated 5 days 
later. The overall frequency of somatic mutations within the 
V segment is 0,06% (2/3,198 bp). Fig. 66B secondary response: 
13 randomly picked VH251-7I cDNA clones. A 2 month old female 
5 HC1 line 26-JHD mouse (#3204) was given 3 injections of HEL 
and Freunds adjuvant over one month (a primary injection with 
complete adjuvant and boosts with incomplete at one week and 3 
weeks) ; spleen and lymph node RNA was isolated 4 months later. 
The overall frequency of somatic mutations within the V 

10 segment is 1.6% (52/3,198 bp). 

Figs. 67A and 67B show that extensive somatic 
mutation is confined to 71 sequences: somatic mutation and 
class switching occur within the same population of B cells. 
Partial nucleotide sequence of VH251 cDNA clones isolated from 

15 spleen and lymph node cells of HC1 line 57 transgenic-JHD 
mouse (#2357) hyper immunized against CEA (see Fig. 63 for 
immunization schedule) . Fig. 67A: IgM: 23 randomly picked 
VH251-jL6 cDNA clones. Nucleotide sequence of 156 bp segment 
including CDRs 1 and 2 surrounding residues. The overall 

20 level of somatic mutation is 0.1% (5/3,744 bp). Fig 67B: IgG: 
23 randomly picked VH251-7I cDNA clones * Nucleotide sequence 
of segment including CDRs 1 through 3 and surrounding 
residues. The overall frequency of somatic mutation within 
the V segment is 1.1% (65/5,658 bp). For comparison with the 

25 jit sequences in Fig. 67A: the mutation frequency for first 
156 nucleotides is 1.1% (41/3,588 bp). See legend to 
Figs. 66A and 66B for explanation of symbols. 

Fig. 68 indicates that VH51P1 and VH56P1 show 
extensive somatic mutation of in an unimmunized mouse. The 

3 0 partial nucleotide sequence of IgG heavy chain variable region 
cDNA clones from a 9 week old, unimmunized female HC2 line 
2550 transgenic-JHD mouse (#5250) . The overall frequency of 
somatic mutation with the 19 VH56pl segments is 2.2% 
(101/4,674 bp). The overall frequency of somatic mutation 

35 within the single VHSlpl segment is 2.0% (5/246 bp). See 
legend to Figs. 66A and 66B for explanation of symbols. 

Fig. 69. Double transgenic mice with disrupted 
endogenous Ig loci contain human IgM* positive B cells. FACS 



of cells isolated from spleens of 4 mice with different 
genotypes. Left column: control mouse (#9944, 6 wk old 
female JH+ / - , JC/c+/-; heterozygous wild-type mouse heavy and 
K-light chain loci, non-transgenic) . Second column: human 
heavy chain transgenic (#9877, 6 wk old female JH-/-, JCk-/-, 
HC2 line 2550 +; homozygous for disrupted mouse heavy and k- 
light chain loci, hemizygous for HC2 transgene) . Third 
column: human K-light chain transgenic (#9878, 6 wk old 
female JH-/-, JCk-/-, KCo4 line 4437 +; homozygous for 
disrupted mouse heavy and <c-light chain loci, hemizygous for 
KCo4 transgene). Right column: double transgenic (#9879, 6 
wk old female JH-/-m JCk-/-, HC2 line 2550 +, KCo4 line 4437 
+ ; homozygous for disrupted mouse heavy and /ck-light chain 
loci, hemizygous for HC2 and KCo4 transgenes) . Top row: 
spleen cells stained for expression of mouse X light chain (x- 
axis) and human k light chain (y-axis) . Second row: spleen 
cells stained for expression of human n heavy chain (x-axis) 
and human k light chain (y-axis) . Third row: spleen cells 
stained for expression of mouse /x heavy chain (x-axis) and 
mouse k light chain (y-axis) . Bottom row: histogram of 
spleen cells stained for expression of mouse B220 antigen (log 
fluorescence: x-axis; cell number: y-axis). For each of the 
two color panels, the relative number of cells in each of the 
displayed guadrants is given as percent of a e-parameter gate 
based on propidium iodide staining and light scatter . The 
fraction of B22 0+ cells in each of the samples displayed in 
the bottom row is given as a percent of the lymphocyte light 

scatter gate. 

Fig. 70. Secreted immunoglobulin levels in the 
serum of double transgenic mice. Human \i , y, and k, and mouse 
y and X from 18 individual HC2/KCo4 double transgenic mice 
homozygous for endogenous heavy and K-light chain locus 
disruption. Mice: (+) HC2 line 2550 ("5 copies of HC2 per 
integration), KCo4 line 4436 (1-2 copies of KCo4 per 
integration); (O) HC2 line 2550, KCo4 line 4437 ("10 copies of 
KCo4 per integration); (x) HC2 line 2550, KCo4 line 4583 (~5 
copies of KCo4 per integration) ; (□) HC2 line 2572 (30-50 
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Fig. 50 shows relative distribution of lymphocytes 
staining for human k or mouse k as determined by FACS for 
three mouse genotypes. 

Fig, 51 shows relative distribution of lymphocytes 
5 staining for mouse X as determined by FACS for three mouse 
genotypes . 

Fig. 52 shows relative distribution of lymphocytes 
staining for mouse X or human k as determined by FACS for four 
mouse genotypes. 

10 Fig. 53 shows the amounts of human /z, human y, human 

k, mouse jjl, mouse y, mouse k, and mouse X chains in the serum 
of unimmunized 0011 mice. 

Fig. 54 shows a scatter plot showing the amounts of 
%j human jlx, human y, human k, mouse jjl, mouse y, mouse k, and 

15 mouse X chains in the serum of unimmunized 0011 mice of 
jj various genotypes . 

S". Fig. 55 shows the titres of antibodies comprising 

J" human \x, human y, or human k chains in anti-CD4 antibodies in 

M- the serum taken at three weeks or seven weeks post- 

20 immunization following immunization of a 0011 mouse with human 
p CD 4 . 

J : Fig. 56 shows a schematic representation of the 

?s human heavy chain minilocus transgenes pHCl and pHC2 , and the 

light chain minilocus transgenes pKCl, pKCle, and the light 
25 chain minilocus transgene created by homologous recombination 
between pKC2 and Co4 at the site indicated. 

Fig. 57 shows a linkage map of the murine lambda 
light chain locus as taken from Storb et al. (1989) op. cit . ; 
the stippled boxes represent a pseudogene. 
3 0 Fig. 58 shows a schematic representation of 

inactivation of the murine X locus by homologous gene 
targeting. 

Fig. 59 schematically shows the structure of a 
homologous recombination targeting transgene for deleting 
3 5 genes, such as heavy chain constant region genes. 

Fig. 60 shows a map of the BALB/c murine heavy chain 
locus as taken from Immunoglobulin Genes , Hon jo, T, Alt, FW, 
and Rabbits TH (eds.) Academic Press, NY (1989) p. 12 9. 
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copies of HC2 per integration, KCo4 line 4437; (a) HC2 line 
5467 (20-30 copies of HC2 per integration, KCo4 line 4437. 

Figs* 71A and 7 IB show human antibody responses to 
human antigens . Fig. 71A: Primary response to recombinant 
5 human soluble CD4 . Levels of human IgM and human k light 

chain are reported for prebleed (O) and post-immunization (•) 
serum from four double transgenic mice. Fig. 7 IB: Switching to 
human IgG occurs in vivo. Human IgG (circles) was detected 
with peroxidase conjugated polyclonal anti-human IgG used in 

10 the presence of 1.5 ju/ml excess IgE, k and 1% normal mouse 

serum to inhibit non-specific cross-reactivity. Human k light 
chain (squares) was detected using a peroxidase conjugated 
polyclonal anti-human k reagent in the presence of 1% normal 
mouse serum. A representative result from one mouse (#9344; 

15 HC2 line 2550, KCo4 line 443 6) is shown. Each point 

represents an average of duplicate wells minus background 
absorbance. 

Fig. 72 shows FACS analysis of human PBL with a 
hybridoma supernatant that discriminates human CD4+ 

2 0 lymphocytes from human CD8+ lymphocytes. 

Fig. 73 shows human a-CD4 IgM anf IgG in transgenic 
mouse serum. 

Fig. 74 shows competition binding experiments 
comparing a transgenic mouse a-human CD4 hybridoma monoclonal, 
25 2C11-8, to the RPA-TA and Leu-3A monoclonals. 

Fig. 75 shows production data for Ig expression of 
cultured 2C11-8 hybridoma. 

Fig. 7 6 shows an overlapping set of plasmid inserts 
constituting the HCo7 transgene. 

3 0 Fig. 77A depicts the nucleotide sequence and 

restriction map of pGP2b plasmid vector. 

Fig. 77B depicts the restriction map of pGP2b 
plasmid vector. 

Fig. 78 (parts A and B) depicts cloning strategy for 
3 5 assembling large transgenes. 

Fig. 79 shows that large inserts are unstable in 
high-copy pUC derived plasmids. 
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Fig. 80 shows phage PI clone Pl-570. Insert spans 
portion of human heavy chain constant region covering 73 and 
yl, together with switch elements. N, NotI; S, Sail, X, Xhol. 

Fig. 81 shows serum expression of human \x and yl in 
5 HCo7 transgenic founder animals. 

Fig. 82 shows serum expression of human 
immunoglobulins in HCo7/KCo4 double transgenic/ double deletion 
mice. 

Fig. 83 shows RT PCR detection of human 7I and 73 
10 transcripts in HCo7 transgenic mouse spleen RNA. 

Fig. 84 shows induction of human IgGl and IgG3 by 
LPS and IL-4 in vitro . 

Fig. 85. Agarose gel electrophoresis apparatus for 
concentration of YAC DNA. 
15 Fig. 86. Two color FACS analysis of bone marrow 

cells from HC2 /KCo5/ JHD/JKD and HC2 /KCo4 / JHD / JKD mice. The 
fraction of cells in each of the B220 + /CD43~, B220 + /CD43 + , and 
B22 0 + /IgM + gates is given as a percent. 

Fig. 87. Two color FACS analysis of spleen cells 
20 from HC2/KCo5 /JHD/JKD and HC2/KCo4 /JHD/JKD mice. The fraction 
of cells in each of the B220 bri ^ ht /IgM + and B220 dull /IgM + gates 
is given as a percent. 

Fig. 88. Binding Of IgGx anti-nCD4 monoclonal antibodies 
to CD4+ SupTl cells. 

25 Fig. 89 Epitope determination for IgG anti-nCD4 

monoclonal antibodies by flow cytometry. SupTl cells were 
pre-incubated with buffer (left column), 2.5 mg/mi rpa-t4 (middle 
column), or 2.5 mg/ml Leu3a (right column) and then with one of the 10 
human IgG monoclonal antibodies (in supernatant diluted 1:2), or chimeric 
30 Leu3a. Results for 3 representative human IgG monoclonal antibodies are 
shown in this figure. 

Fig. 90 Inhibition of an MLR by a human IgGk anti-CD4 
monoclonal antibody. 

Table 1 depicts the sequence of vector pGPe. 
3 5 Table 2 depicts the sequence of gene V H 49.8. 

Table 3 depicts the detection of human IgM and IgG in the serum 
of transgenic mice of this invention. 

Table 4 depicts sequences of VDJ joints. 

Table 5 depicts the distribution of J segments incorporated 
40 into pHCl transgene encoded transcripts to J segments found in adult human 
peripheral blood lymphocytes (PBL). 
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Table 6 depicts the distribution of D segments incorporated 
into pHCl transgene encoded transcripts to D segments found in adult human 
peripheral blood lymphocytes (PBL) . 

Table 7 depicts the length of the CDR3 peptides from 
5 transcripts with in-frame VDJ joints in the pHCl transgenic mouse and in 
human PBL, 

Table 8 depicts the predicted amino acid sequences of the VDJ 
regions from 30 clones analyzed from a pHCl transgenic. 

Table 9 shows transgenic mice of line 112 that were used in the 
10 indicated experiments; (+) indicates the presence of the respective 
transgene, (++) indicates that the animal is homozygous for the J H D 

knockout transgene. 

Table 10 shows the genotypes of several 0011 mice. 

Table 11 shows human variable region usage in hybridomas from 

15 transgenic mice. 

Table 12 shows transgene V and J segment usage. 
Table 13 shows the occurrence of somatic mutation in the HC2 
heavy chain transgene in transgenic mice. 

Table 14 shows identification of human V K segments on the 

20 YAC 4X17E1. 

Table 15. Identification of human Vk genes 

expressed in mouse line KCo5-9272. 

Table 16. Secretion levels for human IgGk Anti-nCD4 

monoclonal antibodies 
25 Table 17. Rate and affinity constants for 

monoclonal antibodies that bind to human CD4. 

Table 18. Affinity and rate constants of human 
anti-human CD 4 monoclonal antibodies. 

Table 19. Avidity and rate constants of human anti- 
3 0 human CD 4 monoclonal antibodies. 

Table 20. Avidity and rate constants reported for 
anti CD 4 monoclonal antibodies. 

Table 21. Avidity constants of human anti-human CD 4 
monoclonal antibodies as determined by flow cytometry. 
35 Table 22. Partial Nucleotide Sequence for 

Functional Transcripts. 

Table 23 Germline V(D)J Segment Usage in Hybridoma 

Transcripts . 

Table 24. Primers, Vectors and Products Used in 
40 Minigene Construction. 



Table 25. Effect of Human mAbs on Peripheral 
Chimpanzee Lymphocytes. 

DETAILED DESCRIPTION 
As has been discussed supra, it is desirable to 
produce human immunoglobulins that are reactive with specific 
human antigens that are promising therapeutic and/or 
diagnostic targets. However, producing human immunoglobulins 
that bind specifically with human antigens is problematic. 

First, the immunized animal that serves as the 
source of B cells must make an immune response against the 
presented antigen. In order for an animal to make an immune 
response, the antigen presented must be foreign and the animal 
must not be tolerant to the antigen. Thus, for example, if it 
is desired to produce a human monoclonal antibody with an 
idiotype that binds to a human protein, self -tolerance will 
prevent an immunized human from making a substantial immune 
response to the human protein, since the only epitopes of the 
antigen that may be immunogenic will be those that result from 
polymorphism of the protein within the human population 
(allogeneic epitopes) . 

Second, if the animal that serves as the source of 
B-cells for forming a hybridoma (a human in the illustrative 
given example) does make an immune response against an 
authentic self antigen, a severe autoimmune disease may result 
in the animal. Where humans would be used as a source of B- 
cells for a hybridoma, such autoimmunization would be 
considered unethical by contemporary standards. Thus, 
developing hybridomas secreting human immunoglobulin 
chainsspecif ically reactive with predetermined human antigens 
is problematic, since a reliable source of human antibody- 
secreting B cells that can evoke an antibody response against 
predetermined human antigens is needed. 

One methodology that can be used to obtain human 
antibodies that are specifically reactive with human antigens 
is the production of a transgenic mouse harboring the human 
immunoglobulin transgene constructs of this invention. 
Briefly, transgenes containing all or portions of the human 
immunoglobulin heavy and light chain loci, or transgenes 



containing synthetic "miniloci" (described infra, and in 
copending applications U.S. S.N. 08/352,322, filed 7 December 
1994, U.S. S.N. 07/990,860, filed 16 December 1992, U.S. S.N. 
07/810,279 filed 17 December 1991, U.S. S.N. 07/904,068 filed 
23 June 1992; U.S. S.N. 07/853,408, filed 18 March 1992, 
U.S. S.N. 07/574,748 filed August 29, 1990, U.S. S.N. 07/575,962 
filed August 31, 1990, and PCT/US91/06185 filed August 28, 
1991, each incorporated herein by reference) which comprise 
essential functional elements of the human heavy and light 
chain loci, are employed to produce a transgenic nonhuman 
animal. Such a transgenic nonhuman animal will have the 
capacity to produce immunoglobulin chains that are encoded by 
human immunoglobulin genes, and additionally will be capable 
of making an immune response against human antigens. Thus, 
such transgenic animals can serve as a source of immune sera 
reactive with specified human antigens, and B-cells from such 
transgenic animals can be fused with myeloma cells to produce 
hybridomas that secrete monoclonal antibodies that are encoded 
by human immunoglobulin genes and which are specifically 
reactive with human antigens. 

The production of transgenic mice containing various 
forms of immunoglobulin genes has been reported previously. 
Rearranged mouse immunoglobulin heavy or light chain genes 
have been used to produce transgenic mice. In addition, 
functionally rearranged human Ig genes including the m or yl 
constant region have been expressed in transgenic mice. 
However, experiments in which the transgene comprises 
unrearranged (V-D-J or V-J not rearranged) immunoglobulin 
genes have been variable, in some cases, producing incomplete 
or minimal rearrangement of the transgene. However, there are 
no published examples of either rearranged or unrearranged 
immunoglobulin transgenes which undergo successful isotype 
switching between C H genes within a transgene. 

The invention also provides a method for identifying 
candidate hybridomas which secrete a monoclonal antibody 
comprising a human immunoglobulin chain consisting essentially 
of a human VDJ sequence in polypeptide linkage to a human 
constant region sequence. Such candidate hybridomas are 



identified from a pool of hybridoma clones comprising: (1) 
hybridoma clones that express immunoglobulin chains consisting 
essentially of a human VDJ region and a human constant region, 
and (2) trans-switched hybridomas that express heterohybrid 
immunoglobulin chains consisting essentially of a human VDJ 
region and a murine constant region. The supernatant (s) of 
individual or pooled hybridoma clones is contacted with a 
predetermined antigen, typically an antigen which is 
immobilized by adsoption onto a solid substrate (e.g., a 
microtitre well) , under binding conditions to select 
antibodies having the predetermined antigen binding 
specificity. An antibody that specifically binds to human 
constant regions is also contacted with the hybridoma 
supernatant and predetermined antigen under binding conditions 
so that the antibody selectively binds to at least one human 
constant region epitope but substantially does not bind to 
murine constant region epitopes; thus forming complexes 
consisting essentially of hybridoma supernatant (transgenic 
monoclonal antibody) bound to a predetermined antigen and to 
an antibody that specifically binds human constant regions 
(and which may be labeled with a detectable label or 
reporter) . Detection of the formation of such complexes 
indicates hybridoma clones or pools which express a human 
immunog 1 obu 1 i n cha i n . 

In a preferred embodiment of the invention, the 
anti-human constant region immunoglobulin used in screening 
specifically recognizes a non-ju, non-5 isotype, preferably a a 
or e, more perferrably a y isotype constant region. 
Monoclonal antibodies of the y isotype are preferred (i) 
because the characteristics of IgG immunoglobulins are 
preferable to IgM immunogloblins for some therapeutic 
applications (e.g., due to the smaller size of the IgG dimers 
compared to IgM pentamers) and, (ii) because the process of 
somatic mutation is correlated with the class switch from the 
M constant region to the non-M (e.g., 7) constant regions. 
Immunoglobulins selected from the population of 
immunoglobulins that have undergone class switch (e.g., IgG) 
tend to bind antigen with higher affinity than immunoglobulins 
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selected from the population that has not undergone class 
switch (e.g., IgM) . See, e.g., Lonberg and Huszar. Intern. 
Rev. Immunol. 13:65-93 (1995) which is incorporated herein by 
reference. 

In one embodiment the candidate hybridomas are first 
screened for the 7 isotype constant region and the pool of 
IgG-expressing hybridomas is then screened for specific 
binding to the predetermined antigen. 

Thus, according to the method, a transgenic mouse of 
the invention is immunized with the predetermined antigen to 
induce an immune response. B cells are collected from the 
mouse and fused to immortal cells to produce hybridomas. The 
hybridomas are first screened to identify individual 
hybridomas secreting Ig of a non-mu, non-delta isotype (e..g., 
IgG) . This set of hybridomas is then screened for specific 
binding to the predetermined antigen of interest. Screening 
is carried out using standard techniques as described in, 
e.g., Harlow and Lane, Antibodies: A Laboratory Manual, Cold 
Spring Harbor, New York (1988) . Using this method it is 
possible to identify high-affinity immunoglobulins (e.g., Ka 
greater than about 10 7 M" 1 ) practically and efficiently. 

Definitions 

As used herein, the term "antibody" refers to a 
glycoprotein comprising at least two light polypeptide chains 
and two heavy polypeptide chains. Each of the heavy and light 
polypeptide chains contains a variable region (generally the 
amino terminal portion of the polypeptide chain) which 
contains a binding domain which interacts with antigen. Each 
of the heavy and light polypeptide chains also comprises a 
constant region of the polypeptide chains (generally the 
carboxyl terminal portion) which may mediate the binding of 
the immunoglobulin to host tissues or factors including 
various cells of the immune system, some phagocytic cells and 
the first component (Clq) of the classical complement system. 

As used herein, a "heterologous antibody" is defined 
in relation to the transgenic non-human organism producing 
such an antibody. It is defined as an antibody having an 



amino acid sequence or an encoding DNA sequence corresponding 
to that found in an organism not consisting of the transgenic 
non-human animal, and generally from a species other than that 
of the transgenic non-human animal. 

As used herein, a "heterohybrid antibody" refers to 
an antibody having a light and heavy chains of different 
organismal origins* For example, an antibody having a human 
heavy chain associated with a murine light chain is a 
heterohybrid antibody. 

As used herein, "isotype" refers to the antibody 
class (e.g., IgM or IgG x ) that is encoded by heavy chain 
constant region genes. 

As used herein, "isotype switching" refers to the 
phenomenon by which the class, or isotype, of an antibody 
changes from one Ig class to one of the other Ig classes. 

As used herein, "nonswitched isotype" refers to the 
isotypic class of heavy chain that is produced when no isotype 
switching has taken place; the C H gene encoding the 
nonswitched isotype is typically the first C H gene immediately 
downstream from the functionally rearranged VDJ gene. 

As used herein, the term "switch sequence" refers to 
those DNA sequences responsible for switch recombination. A 
"switch donor" sequence, typically a fx switch region, will be 
5» (i.e., upstream) of the construct region to be deleted 
during the switch recombination. The "switch acceptor" region 
will be between the construct region to be deleted and the 
replacement constant region (e.g., 7, e, etc.). As there is 
no specific site where recombination always occurs, the final 
gene sequence will typically not be predictable from the 
construct . 

As used herein, "glycosylation pattern" is defined 
as the pattern of carbohydrate units that are covalently 
attached to a protein, more specifically to an immunoglobulin 
protein. A glycosylation pattern of a heterologous antibody 
can be characterized as being substantially similar to 
glycosylation patterns which occur naturally on antibodies 
produced by the species of the nonhuman transgenic animal, 
when one of ordinary skill in the art would recognize the 



glycosylation pattern of the heterologous antibody as being 
more similar to said pattern of glycosylation in the species 
of the nonhuman transgenic animal than to the species from 
which the C H genes of the transgene were derived* 

As used herein, "specific binding" refers to the 
property of the antibody: (1) to bind to a predetermined 
antigen with an affinity of at least 1 x 10 7 M" 1 , and (2) to 
preferentially bind to the predetermined antigen with an 
affinity that is at least two-fold greater than its affinity 
for binding to a non-specific antigen (e.g., BSA, casein) 
other than the predetermined antigen or a closely-related 
antigen. 

The term "naturally-occurring" as used herein as 
applied to an object refers to the fact that an object can be 
found in nature. For example, a polypeptide or polynucleotide 
sequence that is present in an organism (including viruses) 
that can be isolated from a source in nature and which has not 
been intentionally modified by man in the laboratory is 
naturally-occurring . 

The term "rearranged" as used herein refers to a 
configuration of a heavy chain or light chain immunoglobulin 
locus wherein a V segment is positioned immediately adjacent 
to a D-J or J segment in a conformation encoding essentially a 
complete V H or V L domain, respectively. A rearranged 
immunoglobulin gene locus can be identified by comparison to 
germline DNA; a rearranged locus will have at least one 
recombined heptamer/nonamer homology element. 

The term "unrearranged" or "germline configuration" 
as used herein in reference to a V segment refers to the 
configuration wherein the V segment is not recombined so as to 
be immediately adjacent to a D or J segment. 

For nucleic acids, the term "substantial homology" 
indicates that two nucleic acids, or designated sequences 
thereof, when optimally aligned and compared, are identical, 
with appropriate nucleotide insertions or deletions, in at 
least about 80% of the nucleotides, usually at least about 90% 
to 95%, and more preferably at least about 98 to 99.5% of the 
nucleotides. Alternatively, substantial homology exists when 



the segments will hybridize under selective hybridization 
conditions, to the complement of the strand. The nucleic 
acids may be present in whole cells, in a cell lysate, or in a 
partially purified or substantially pure form* A nucleic acid 
is "isolated" or "rendered substantially pure" when purified 
away from other cellular components or other contaminants, 
e.g., other cellular nucleic acids or proteins, by standard 
techniques, including alkaline/ SDS treatment, CsCl banding, 
column chromatography, agarose gel electrophoresis and others 
well known in the art. See , F. Ausubel, et al., ed. Current 
Protocols in Molecular Biology , Greene Publishing and Wiley- 
Interscience, New York (1987) . 

The nucleic acid compositions of the present 
invention, while often in a native sequence (except for 
modified restriction sites and the like) , from either cDNA, 
genomic or mixtures may be mutated, thereof in accordance with 
standard techniques to provide gene sequences. For coding 
sequences, these mutations, may affect amino acid sequence as 
desired. In particular, DNA sequences substantially 
homologous to or derived from native V, D, J, constant, 
switches and other such sequences described herein are 
contemplated (where "derived" indicates that a sequence is 
identical or modified from another sequence) . 

A nucleic acid is "operably linked" when it is 
placed into a functional relationship with another nucleic 
acid sequence. For instance, a promoter or enhancer is 
operably linked to a coding sequence if it affects the 
transcription of the sequence. With respect to transcription 
regulatory sequences, operably linked means that the DNA 
sequences being linked are contiguous and, where necessary to 
join two protein coding regions, contiguous and in reading 
frame. For switch sequences, operably linked indicates that 
the sequences are capable of effecting switch recombination. 

Transgenic Nonhuman Animals Capable 
of Producing Heterologous Antibodies 

The design of a transgenic non-human animal that 

responds to foreign antigen stimulation with a heterologous 

antibody repertoire, requires that the heterologous 



immunoglobulin transgenes contained within the transgenic 
animal function correctly throughout the pathway of B-cell 
development. In a preferred embodiment, correct function of a 
heterologous heavy chain transgene includes isotype switching. 
Accordingly, the transgenes of the invention are constructed 
so as to produce isotype switching and one or more of the 
following: (1) high level and cell-type specific expression, 
(2) functional gene rearrangement, (3) activation of and 
response to allelic exclusion, (4) expression of a sufficient 
primary repertoire, (5) signal transduction, (6) somatic 
hy permutation, and (7) domination of the transgene antibody 
locus during the immune response. 

As will be apparent from the following disclosure, 
not all of the foregoing criteria need be met. For example, in 
those embodiments wherein the endogenous immunoglobulin loci 
of the transgenic animal are functionally disrupted, the 
transgene need not activate allelic exclusion. Further, in 
those embodiments wherein the transgene comprises a 
functionally rearranged heavy and/or light chain 
immunoglobulin gene, the second criteria of functional gene 
rearrangement is unnecessary, at least for that transgene 
which is already rearranged. For background on molecular 
immunology, see, Fundamental Immunology , 2nd edition (1989) , 
Paul William E . , ed. Raven Press, N.Y., which is incorporated 
herein by reference. 

In one aspect of the invention, transgenic non-human 
animals are provided that contain rearranged, unrearranged or 
a combination of rearranged and unrearranged heterologous 
immunoglobulin heavy and light chain transgenes in the 
germline of the transgenic animal. Each of the heavy chain 
transgenes comprises at least one C H gene. In addition, the 
heavy chain transgene may contain functional isotype switch 
sequences, which are capable of supporting isotype switching 
of a heterologous transgene encoding multiple C H genes in B- 
cells of the transgenic animal. Such switch sequences may be 
those which occur naturally in the germline immunoglobulin 
locus from the species that serves as the source of the 
transgene C H genes, or such switch sequences may be derived 



from those which occur in the species that is to receive the 
transgene construct (the transgenic animal) . For example, a 
human transgene construct that is used to produce a transgenic 
mouse may produce a higher frequency of isotype switching 
events if it incorporates switch sequences similar to those 
that occur naturally in the mouse heavy chain locus, as 
presumably the mouse switch sequences are optimized to 
function with the mouse switch recombinase enzyme system, 
whereas the human switch sequences are not. Switch sequences 
made be isolated and cloned by conventional cloning methods, 
or may be synthesized de novo from overlapping synthetic 
oligonucleotides designed on the basis of published sequence 
information relating to immunoglobulin switch region sequences 
(Mills et al., Nucl. Acids Res . 18:7305-7316 (1991); 
Sideras et al. , Intl. Immunol . 1:631-642 (1989), which are 
incorporated herein by reference) . 

For each of the foregoing transgenic animals, 
functionally rearranged heterologous heavy and light chain 
immunoglobulin transgenes are found in a significant fraction 
of the B-cells of the transgenic animal (at least 10 percent) . 

The transgenes of the invention include a heavy 
chain transgene comprising DNA encoding at least one variable 
gene segment, one diversity gene segment, one joining gene 
segment and at least one constant region gene segment. The 
immunoglobulin light chain transgene comprises DNA encoding at 
least one variable gene segment, one joining gene segment and 
at least one constant region gene segment. The gene segments 
encoding the light and heavy chain gene segments are 
heterologous to the transgenic non-human animal in that they 
are derived from, or correspond to, DNA encoding 
immunoglobulin heavy and light chain gene segments from a 
species not consisting of the transgenic non-human animal. In 
one aspect of the invention, the transgene is constructed such 
that the individual gene segments are unrearranged , i.e., not 
rearranged so as to encode a functional immunoglobulin light 
or heavy chain. Such unrearranged transgenes support 
recombination of the V, D, and J gene segments (functional 
rearrangement) and preferably support incorporation of all or 
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a portion of a D region gene segment in the resultant 
rearranged immunoglobulin heavy chain within the transgenic 
non-human animal when exposed to antigen. 

In an alternate embodiment, the transgenes comprise 
5 an unrearranged "mini-locus". Such transgenes typically 

comprise a substantial portion of the C, D, and J segments as 
well as a subset of the V gene segments. In such transgene 
constructs, the various regulatory sequences, e.g. promoters, 
enhancers, class switch regions, splice-donor and splice- 

10 acceptor sequences for RNA processing, recombination signals 
and the like, comprise corresponding sequences derived from 
the heterologous DNA. Such regulatory sequences may be 
incorporated into the transgene from the same or a related 
species of the non-human animal used in the invention. For 

15 example, human immunoglobulin gene segments may be combined in 
a transgene with a rodent immunoglobulin enhancer sequence for 
use in a transgenic mouse. Alternatively, synthetic regulatory 
sequences may be incorporated into the transgene, wherein such 
synthetic regulatory sequences are not homologous to a 

2 0 functional DNA sequence that is known to occur naturally in 

the genomes of mammals. Synthetic regulatory sequences are 
designed according to consensus rules, such as, for example, 
those specifying the permissible sequences of a splice- 
acceptor site or a promoter/ enhancer motif. For example, a 
25 minilocus comprises a portion of the genomic immunoglobulin 

locus having at least one internal (i.e., not at a terminus of 
the portion) deletion of a non-essential DNA portion (e.g., 
intervening sequence; intron or portion thereof) as compared 
to the naturally-occurring germline Ig locus. 

3 0 The invention also includes transgenic animals 

containing germ line cells having a heavy and light transgene 
wherein one of the said transgenes contains rearranged gene 
segments with the other containing unrearranged gene segments. 
In the preferred embodiments, the rearranged transgene is a 
3 5 light chain immunoglobulin transgene and the unrearranged 
transgene is a heavy chain immunoglobulin transgene. 

The Structure and Generation of Antibodies 



The basic structure of all immunoglobulins is based 
upon a unit consisting of two light polypeptide chains and two 
heavy polypeptide chains. Each light chain comprises two 
regions known as the variable light chain region and the 
constant light chain region. Similarly, the immunoglobulin 
heavy chain comprises two regions designated the variable 
heavy chain region and the constant heavy chain region. 

The constant region for the heavy or light chain is 
encoded by genomic sequences referred to as heavy or light 
constant region gene (C H ) segments. The use of a particular 
heavy chain gene segment defines the class of immunoglobulin. 
For example, in humans, the ju constant region gene segments 
define the IgM class of antibody whereas the use of a y, y2 , 
73 or 74 constant region gene segment defines the IgG class of 
antibodies as well as the IgG subclasses IgGl through IgG4 . 
Similarly, the use of a a x or <x 2 constant region gene segment 
defines the IgA class of antibodies as well as the subclasses 
IgAl and IgA2 . The S and e constant region gene segments 
define the IgD and IgE antibody classes, respectively. 

The variable regions of the heavy and light 
immunoglobulin chains together contain the antigen binding 
domain of the antibody. Because of the need for diversity in 
this region of the antibody to permit binding to a wide range 
of antigens, the DNA encoding the initial or primary 
repertoire variable region comprises a number of different DNA 
segments derived from families of specific variable region 
gene segments. In the case of the light chain variable 
region, such families comprise variable (V) gene segments and 
joining (J) gene segments. Thus, the initial variable region 
of the light chain is encoded by one V gene segment and one J 
gene segment each selected from the family of V and J gene 
segments contained in the genomic DNA of the organism. In the 
case of the heavy chain variable region, the DNA encoding the 
initial or primary repertoire variable region of the heavy 
chain comprises one heavy chain V gene segment, one heavy 
chain diversity (D) gene segment and one J gene segment, each 
selected from the appropriate V, D and J families of 
immunoglobulin gene segments in genomic DNA. 



36 

In order to increase the diversity of sequences that 
contribute to forming antibody binding sites, it is preferable 
that a heavy chain transgene include cis-acting sequences that 
support functional V-D-J rearrangement that can incorporate 
5 all or part of a D region gene sequence in a rearranged V-D-J 
gene sequence* Typically, at least about 1 percent of 
expressed transgene-encoded heavy chains (or mRNAs) include 
recognizable D region sequences in the V region. Preferably, 
at least about 10 percent of transgene-encoded V regions 
10 include recognizable D region sequences, more preferably at 
least about 3 0 percent, and most preferably more than 50 
percent include recognizable D region sequences. 

A recognizable D region sequence is generally at 
least about eight consecutive nucleotides corresponding to a 
15 sequence present in a D region gene segment of a heavy chain 
transgene and/ or the amino acid sequence encoded by such D 
region nucleotide sequence. For example, if a transgene 
includes the D region gene DHQ52, a transgene-encoded iciRNA 
containing the sequence 5 ■ -TAACTGGG-3 • located in the V region 
20 between a V gene segment sequence and a J gene segment 

sequence is recognizable as containing a D region sequence, 
specifically a DHQ52 sequence. Similarly, for example, if a 
transgene includes the D region gene DHQ52, a transgene- 
encoded heavy chain polypeptide containing the amino acid 
25 sequence -DAF- located in the V region between a V gene 

segment amino acid sequence and a J gene segment amino acid 
sequence may be recognizable as containing a D region 
sequence, specifically a DHQ52 sequence. However, since D 
region segments may be incorporated in VDJ joining to various 
3 0 extents and in various reading frames, a comparison of the D 
region area of a heavy chain variable region to the D region 
segments present in the transgene is necessary to determine 
the incorporation of particular D segments. Moreover, 
potential exonuclease digestion during recombination may lead 
3 5 to imprecise V-D and D-J joints during V-D-J recombination. 

However, because of somatic mutation and N-region 
addition, some D region sequences may be recognizable but may 
not correspond identically to a consecutive D region sequence 



in the transgene. For example, a nucleotide sequence 5»- 
CTAAXTGGGG-3 ' , where X is A, T, or G, and which is located in 
a heavy chain V region and flanked by a V region gene sequence 
and a J region gene sequence, can be recognized as 
corresponding to the DHQ52 sequence 5 • -CTAACTGGG- 3 ' . 
Similarly, for example, the polypeptide sequences -DAFDI-, 
-DYFDY-, or -GAFDI- located in a V region and flanked on the 
amino-terminal side by an amino acid sequence encoded by a 
transgene V gene sequence and flanked on the carboxy terminal 
side by an amino acid sequence encoded by a transgene J gene 
sequence is recognizable as a D region sequence. 

Therefore, because somatic mutation and N-region 
addition can produce mutations in sequences derived from a 
transgene D region, the following definition is provided as a 
guide for determining the presence of a recognizable D region 
sequence. An amino acid sequence or nucleotide sequence is 
recognizable as a D region sequence if: (1) the sequence is 
located in a V region and is flanked on one side by a V gene 
sequence (nucleotide sequence or deduced amino acid sequence) 
and on the other side by a J gene sequence (nucleotide 
sequence or deduced amino acid sequence) and (2) the sequence 
is substantially identical or substantially similar to a known 
D gene sequence (nucleotide sequence or encoded amino acid 
sequence) . 

The term "substantial identity" as used herein 
denotes a characteristic of a polypeptide sequence or nucleic 
acid sequence, wherein the polypeptide sequence has at least 
50 percent sequence identity compared to a reference sequence, 
often at least about 80% sequence identity and sometimes more 
0 than about 90% sequence identity, and the nucleic acid 

sequence has at least 70 percent sequence identity compared to 
a reference sequence. The percentage of sequence identity is 
calculated excluding small deletions or additions which total 
less than 35 percent of the reference sequence. The reference 
5 sequence may be a subset of a larger sequence, such as an 
entire D gene; however, the reference sequence is at least 8 
nucleotides long in the case of polynucleotides, and at least 
3 amino residues long in the case of a polypeptide. 
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Typically, the reference sequence is at least 8 to 12 
nucleotides or at least 3 to 4 amino acids, and preferably the 
reference sequence is 12 to 15 nucleotides or more, or at 
least 5 amino acids . 
5 The term "substantial similarity" denotes a 

characteristic of an polypeptide sequence, wherein the 
polypeptide sequence has at least 80 percent similarity to a 
reference sequence- The percentage of sequence similarity is 
calculated by scoring identical amino acids or positional 

10 conservative amino acid substitutions as similar. A 

positional conservative amino acid substitution is one that 
can result from a single nucleotide substitution; a first 
amino acid is replaced by a second amino acid where a codon 
for the first amino acid and a codon for the second amino acid 

15 can differ by a single nucleotide substitution. Thus, for 
example, the sequence -Lys-Glu-Arg-Val- is substantially 
similar to the sequence -Asn-Asp-Ser-Val- , since the codon 
sequence -AAA-GAA-AGA-GUU- can be mutated to -AAC-GAC-AGC-GUU- 
by introducing only 3 substitution mutations, single 

2 0 nucleotide substitutions in three of the four original codons. 

The reference sequence may be a subset of a larger sequence, 
such as an entire D gene; however, the reference sequence is 
at least 4 amino residues long. Typically, the reference 
sequence is at least 5 amino acids, and preferably the 
25 reference sequence is 6 amino acids or more. 

The Primary Repertoire 

The process for generating DNA encoding the heavy 
and light chain immunoglobulin genes occurs primarily in 

3 0 developing B-cells. Prior to the joining of various 

immunoglobulin gene segments, the V, D, J and constant (C) 
gene segments are found, for the most part, in clusters of V, 
D, J and C gene segments in the precursors of primary 
repertoire B-cells. Generally, all of the gene segments for a 
35 heavy or light chain are located in relatively close proximity 
on a single chromosome. Such genomic DNA prior to 
recombination of the various immunoglobulin gene segments is 
referred to herein as "unrearranged" genomic DNA. During 



B-cell differentiation, one of each of the appropriate family 
members of the V, D, J (or only V and J in the case of light 
chain genes) gene segments are recombined to form functionally 
rearranged heavy and light immunoglobulin genes. Such 
functional rearrangement is of the variable region segments to 
form DNA encoding a functional variable region. This gene 
segment rearrangement process appears to be sequential. 
First, heavy chain D-to-J joints are made, followed by heavy 
chain V-to-DJ joints and light chain V-to-J joints. The DNA 
encoding this initial form of a functional variable region in 
a light and/or heavy chain is referred to as "functionally 
rearranged DNA" or "rearranged DNA". In the case of the heavy 
chain, such DNA is referred to as "rearranged heavy chain DNA" 
and in the case of the light chain, such DNA is referred to as 
"rearranged light chain DNA" . Similar language is used to 
describe the functional rearrangement of the transgenes of the 
invention. 

The recombination of variable region gene segments 
to form functional heavy and light chain variable regions is 
mediated by recombination signal sequences (RSS's) that flank 
recombinationally competent V, D and J segments. RSS's 
necessary and sufficient to direct recombination, comprise a 
dyad-symmetric heptamer, an AT-rich nonamer and an intervening 
spacer region of either 12 or 23 base pairs. These signals 
are conserved among the different loci and species that carry 
out D-J (or V-J) recombination and are functionally 
interchangeable. See Oettinger, et al. (1990), Science, 248, 
1517-1523 and references cited therein. The heptamer 
comprises the sequence CACAGTG or its analogue followed by a 
spacer of unconserved sequence and then a nonamer having the 
sequence ACAAAAACC or its analogue. These sequences are found 
on the J, or downstream side, of each V and D gene segment. 
Immediately preceding the germline D and J segments are again 
two recombination signal sequences, first the nonamer and then 
the heptamer again separated by an unconserved sequence. The 
heptameric and nonameric sequences following a V L , V H or D 
segment are complementary to those preceding the J L , D or J H 
segments with which they recombine. The spacers between the 



heptameric and nonameric sequences are either 12 base pairs 
long or between 22 and 24 base pairs long* 

In addition to the rearrangement of V, D and J 
segments, further diversity is generated in the primary 
repertoire of immunoglobulin heavy and light chain by way of 
variable recombination between the V and J segments in the 
light chain and between the D and J segments of the heavy 
chain. Such variable recombination is generated by variation 
in the exact place at which such segments are joined. Such 
variation in the light chain typically occurs within the last 
codon of the V gene segment and the first codon of the J 
segment. Similar imprecision in joining occurs on the heavy 
chain chromosome between the D and J H segments and may extend 
over as many as 10 nucleotides. Furthermore, several 
nucleotides may be inserted between the D and J H and between 
the V H and D gene segments which are not encoded by genomic 
DNA. The addition of these nucleotides is known as N-region 
diversity. 

After VJ and/ or VDJ rearrangement, transcription of 
the rearranged variable region and one or more constant region 
gene segments located downstream from the rearranged variable 
region produces a primary RNA transcript which upon 
appropriate RNA splicing results in an mRNA which encodes a 
full length heavy or light immunoglobulin chain. Such heavy 
and light chains include a leader signal sequence to effect 
secretion through and/or insertion of the immunoglobulin into 
the transmembrane region of the B-cell. The DNA encoding this 
signal sequence is contained within the first exon of the V 
segment used to form the variable region of the heavy or light 
immunoglobulin chain. Appropriate regulatory sequences are 
also present in the mRNA to control translation of the mRNA to 
produce the encoded heavy and light immunoglobulin 
polypeptides which upon proper association with each other 
form an antibody molecule. 

The net effect of such rearrangements in the 
variable region gene segments and the variable recombination 
which may occur during such joining, is the production of a 
primary antibody repertoire. Generally, each B-cell which has 



differentiated to this stage, produces a single primary 
repertoire antibody. During this differentiation process, 
cellular events occur which suppress the functional 
rearrangement of gene segments other than those contained 
within the functionally rearranged Ig gene. The process by 
which diploid B-cells maintain such mono-specificity is termed 
allelic exclusion. 

The Secondary Repertoire 

B-cell clones expressing immunoglobulins from within 
the set of sequences comprising the primary repertoire are 
immediately available to respond to foreign antigens. Because 
of the limited diversity generated by simple VJ and VDJ 
joining, the antibodies produced by the so-called primary 
response are of relatively low affinity. Two different types 
of B-cells make up this initial response: precursors of 
primary antibody-forming cells and precursors of secondary 
repertoire B-cells (Linton et al. , Cell 59:1049-1059 (1989)). 
The first type of B-cell matures into IgM-secreting plasma 
cells in response to certain antigens. The other B-cells 
respond to initial exposure to antigen by entering a T-cell 
dependent maturation pathway. 

During the T-cell dependent maturation of antigen 
stimulated B-cell clones, the structure of the antibody 
molecule on the cell surface changes in two ways: the constant 
region switches to a non-IgM subtype and the sequence of the 
variable region can be modified by multiple single amino acid 
substitutions to produce a higher affinity antibody molecule. 

As previously indicated, each variable region of a 
heavy or light Ig chain contains an antigen binding domain. 
It has been determined by amino acid and nucleic acid 
sequencing that somatic mutation during the secondary response 
occurs throughout the V region including the three 
complementary determining regions (CDR1, CDR2 and CDR3) also 
referred to as hypervariable regions 1, 2 and 3 (Kabat et al. 
Sequences of Proteins of Immunological Interest (1991) U.S. 
Department of Health and Human Services, Washington, DC, 
incorporated herein by reference. The CDR1 and CDR2 are 



located within the variable gene segment whereas the CDR3 is 
largely the result of recombination between V and J gene 
segments or V, D and J gene segments. Those portions of the 
variable region which do not consist of CDR1, 2 or 3 are 
commonly referred to as framework regions designated FR1, FR2 , 
FR3 and FR4 . See Fig. 1. During hypermutation, the 
rearranged DNA is mutated to give rise to new clones with 
altered Ig molecules. Those clones with higher affinities for 
the foreign antigen are selectively expanded by helper 
T-cells, giving rise to affinity maturation of the expressed 
antibody. Clonal selection typically results in expression of 
clones containing new mutation within the CDR1, 2 and/or 3 
regions. However, mutations outside these regions also occur 
which influence the specificity and affinity of the antigen 
binding domain. 

Transgenic Non-Human Animals Capable 
of Producing Heterologous Antibody 

Transgenic non-human animals in one aspect of the 
invention are produced by introducing at least one of the 
immunoglobulin transgenes of the invention (discussed 
hereinafter) into a zygote or early embryo of a non-human 
animal. The non-human animals which are used in the invention 
generally comprise any mammal which is capable of rearranging 
immunoglobulin gene segments to produce a primary antibody 
response. Such nonhuman transgenic animals may include, for 
example, transgenic pigs, transgenic rats, transgenic rabbits, 
transgenic cattle, and other transgenic animal species, 
particularly mammalian species, known in the art. A 
particularly preferred non-human animal is the mouse or other 
members of the rodent family. 

However, the invention is not limited to the use of 
mice. Rather, any non-human mammal which is capable of 
mounting a primary and secondary antibody response may be 
used. Such animals include non-human primates, such as 
chimpanzee, bovine, ovine, and porcine species, other members 
of the rodent family, e.g. rat, as well as rabbit and guinea 
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pig. Particular preferred animals are mouse, rat, rabbit and 
guinea pig, most preferably mouse. 

In one embodiment of the invention, various gene 
segments from the human genome are used in heavy and light 
5 chain transgenes in an unrearranged form. In this embodiment, 
such transgenes are introduced into mice. The unrearranged 
gene segments of the light and/ or heavy chain transgene have 
DNA sequences unique to the human species which are 
distinguishable from the endogenous immunoglobulin gene 

10 segments in the mouse genome. They may be readily detected in 
unrearranged form in the germ line and somatic cells not 
consisting of B-cells and in rearranged form in B-cells. 

In an alternate embodiment of the invention, the 
transgenes comprise rearranged heavy and/or light 

15 immunoglobulin transgenes. Specific segments of such 

transgenes corresponding to functionally rearranged VDJ or VJ 
segments, contain immunoglobulin DNA sequences which are also 
clearly distinguishable from the endogenous immunoglobulin 
gene segments in the mouse. 

2 0 Such differences in DNA sequence are also reflected 

in the amino acid sequence encoded by such human 
immunoglobulin transgenes as compared to those encoded by 
mouse B-cells. Thus, human immunoglobulin amino acid 
sequences may be detected in the transgenic non-human animals 
25 of the invention with antibodies specific for immunoglobulin 
epitopes encoded by human immunoglobulin gene segments. 

Transgenic B-cells containing unrearranged 
transgenes from human or other species functionally recombine 
the appropriate gene segments to form functionally rearranged 

3 0 light and heavy chain variable regions. It will be readily 

apparent that the antibody encoded by such rearranged 
transgenes has a DNA and/ or amino acid sequence which is 
heterologous to that normally encountered in the nonhuman 
animal used to practice the invention. 

35 

Unrearranged Transgenes 

As used herein, an "unrearranged immunoglobulin 
heavy chain transgene" comprises DNA encoding at least one 



variable gene segment, one diversity gene segment, one joining 
gene segment and one constant region gene segment. Each of 
the gene segments of said heavy chain transgene are derived 
from, or has a sequence corresponding to, DNA encoding 
immunoglobulin heavy chain gene segments from a species not 
consisting of the non-human animal into which said transgene 
is introduced. Similarly, as used herein, an "unrearranged 
immunoglobulin light chain transgene" comprises DNA encoding 
at least one variable gene segment, one joining gene segment 
and at least one constant region gene segment wherein each 
gene segment of said light chain transgene is derived from, or 
has a sequence corresponding to, DNA encoding immunoglobulin 
light chain gene segments from a species not consisting of the 
non-human animal into which said light chain transgene is 
introduced. 

Such heavy and light chain transgenes in this aspect 
of the invention contain the above-identified gene segments in 
an unrearranged form. Thus, interposed between the V, D and J 
segments in the heavy chain transgene and between the V and J 
segments on the light chain transgene are appropriate 
recombination signal sequences (RSS's). In addition, such 
transgenes also include appropriate RNA splicing signals to 
join a constant region gene segment with the VJ or VDJ 
rearranged variable region. 

In order to facilitate isotype switching within a 
heavy chain transgene containing more than one C region gene 
segment, e.g. Cjit and C7I from the human genome, as explained 
below "switch regions" are incorporated upstream from each of 
the constant region gene segments and downstream from the 
variable region gene segments to permit recombination between 
such constant regions to allow for immunoglobulin class 
switching, e.g. from IgM to IgG. Such heavy and light 
immunoglobulin transgenes also contain transcription control 
sequences including promoter regions situated upstream from 
the variable region gene segments which typically contain TATA 
motifs. A promoter region can be defined approximately as a 
DNA sequence that, when operably linked to a downstream 
sequence, can produce transcription of the downstream 



sequence. Promoters may require the presence of additional 
linked cis-acting sequences in order to produce efficient 
transcription. In addition, other sequences that participate 
in the transcription of sterile transcripts are preferably 
included. Examples of sequences that participate in 
expression of sterile transcripts can be found in the 
published literature, including Rothman et al. , Intl. Immunol. 
2:621-627 (1990); Reid et al., Proc. Natl. Acad. Sci. USA 
86:840-844 (1989); Stavnezer et al. , Proc. Natl. Acad. Sci. 
USA 85:7704-7708 (1988); and Mills et al., Nucl. Acids Res. 
18:7305-7316 (1991), each of which is incorporated herein by 
reference. These sequences typically include about at least 
50 bp immediately upstream of a switch region, preferably 
about at least 2 00 bp upstream of a switch region; and more 
preferably about at least 200-1000 bp or more upstream of a 
switch region. Suitable sequences occur immediately upstream 
of the human S yl , S y2 , S y3f S y4 , S al , S^, and S e switch regions; 
the sequences immediately upstream of the human S yl , and S y3 
switch regions can be used to advantage, with S t1 generally 
preferred. Alternatively, or in combination, murine Ig switch 
sequences may be used; it may frequently be advantageous to 
employ Ig switch sequences of the same species as the 
transgenic non-human animal. Furthermore, interferon (IFN) 
inducible transcriptional regulatory elements, such as IFN- 
inducible enhancers, are preferably included immediately 
upstream of transgene switch sequences. 

In addition to promoters, other regulatory sequences 
which function primarily in B-lineage cells are used. Thus, 
for example, a light chain enhancer sequence situated 
preferably between the J and constant region gene segments on 
the light chain transgene is used to enhance transgene 
expression, thereby facilitating allelic exclusion. In the 
case of the heavy chain transgene, regulatory enhancers and 
also employed. Such regulatory sequences are used to maximize 
the transcription and translation of the transgene so as to 
induce allelic exclusion and to provide relatively high levels 
of transgene expression. 
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Although the foregoing promoter and enhancer 
regulatory control sequences have been generically described, 
such regulatory sequences may be heterologous to the nonhuman 
animal being derived from the genomic DNA from which the 
5 heterologous transgene immunoglobulin gene segments are 
obtained. Alternately, such regulatory gene segments are 
derived from the corresponding regulatory sequences in the 
genome of the non-human animal, or closely related species, 
which contains the heavy and light transgene. 

10 In the preferred embodiments, gene segments are 

derived from human beings. The transgenic non-human animals 
harboring such heavy and light transgenes are capable of 
mounting an Ig-mediated immune response to a specific antigen 
administered to such an animal. B-cells are produced within 

15 such an animal which are capable of producing heterologous 

human antibody. After immortalization, and the selection for 
an appropriate monoclonal antibody (Mab) , e.g. a hybridoma, a 
source of therapeutic human monoclonal antibody is provided. 
Such human Mabs have significantly reduced immunogenicity when 

2 0 therapeutically administered to humans. 

Although the preferred embodiments disclose the 
construction of heavy and light transgenes containing human 
gene segments, the invention is not so limited. In this 
regard, it is to be understood that the teachings described 
25 herein may be readily adapted to utilize immunoglobulin gene 
segments from a species other than human beings. For example, 
in addition to the therapeutic treatment of humans with the 
antibodies of the invention, therapeutic antibodies encoded by 
appropriate gene segments may be utilized to generate 

3 0 monoclonal antibodies for use in the veterinary sciences. 

Rearranged Transgenes 

In an alternative embodiment, transgenic nonhuman 
animals contain functionally at least one rearranged 
3 5 heterologous heavy chain immunoglobulin transgene in the 
germline of the transgenic animal. Such animals contain 
primary repertoire B-cells that express such rearranged heavy 
transgenes. Such B-cells preferably are capable of undergoing 
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somatic mutation when contacted with an antigen to form a 
heterologous antibody having high affinity and specificity for 
the antigen. Said rearranged transgenes will contain at least 
two C H genes and the associated sequences required for isotype 
5 switching. 

The invention also includes transgenic animals 
containing germ line cells having heavy and light transgenes 
wherein one of the said transgenes contains rearranged gene 
segments with the other containing unrearranged gene segments. 

10 In such animals, the heavy chain transgenes shall have at 

least two C H genes and the associated sequences required for 
isotype switching . 

The invention further includes methods for 
generating a synthetic variable region gene segment repertoire 

15 to be used in the transgenes of the invention. The method 

comprises generating a population of immunoglobulin V segment 
DNAs wherein each of the V segment DNAs encodes an 
immunoglobulin V segment and contains at each end a cleavage 
recognition site of a restriction endonuclease. The 

2 0 population of immunoglobulin V segment DNAs is thereafter 

concatenated to form the synthetic immunoglobulin V segment 
repertoire. Such synthetic variable region heavy chain 
transgenes shall have at least two C H genes and the associated 
sequences required for isotype switching. 

25 

Isotype Switching 

In the development of a B lymphocyte, the cell 
initially produces IgM with a binding specificity determined 
by the productively rearranged V H and V L regions. 
30 Subsequently, each B cell and its progeny cells synthesize 

antibodies with the same L and H chain V regions, but they may 
switch the isotype of the H chain. 

The use of \i or S constant regions is largely 
determined by alternate splicing, permitting IgM and IgD to be 

3 5 coexpressed in a single cell. The other heavy chain isotypes 

(7, a, and e) are only expressed natively after a gene 
rearrangement event deletes the Cfi and CS exons. This gene 
rearrangement process, termed isotype switching, typically 
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occurs by recombination between so called switch segments 
located immediately upstream of each heavy chain gene (except 
6) . The individual switch segments are between 2 and 10 kb in 
length, and consist primarily of short repeated sequences. 
5 The exact point of recombination differs for individual class 
switching events. Investigations which have used solution 
hybridization kinetics or Southern blotting with cDNA-derived 
C H probes have confirmed that switching can be associated with 
loss of C H sequences from the cell. 

10 The switch ( S ) region of the /i gene , S^ , is located 

about 1 to 2 kb 5 1 to the coding sequence and is composed of 
numerous tandem repeats of sequences of the form 
(GAGCT) n (GGGGT) , where n is usually 2 to 5 but can range as 
high as 17. ( See T. Nikaido et al. Nature 292 : 845-848 (1981)) 

15 Similar internally repetitive switch sequences 

spanning several kilobases have been found 5 1 of the other C H 
genes. The S a region has been sequenced and found to consist 
of tandemly repeated 80-bp homology units, whereas murine S y2a , 
S y2b , and S y3 all contain repeated 49-bp homology units very 

20 similar to each other. ( See . P. Szurek et al. , J. Immunol 
135 : 620-626 (1985) and T. Nikaido et al., J. Biol. Chem. 
257 : 7322-7329 (1982), which are incorporated herein by 
reference.) All the sequenced S regions include numerous 
occurrences of the pentamers GAGCT and GGGGT that are the 

25 basic repeated elements of the S p gene (T. Nikaido et al., 

Biol. Chem. 257 : 7322-7329 (1982) which is incorporated herein 
by reference) ; in the other S regions these pentamers are not 
precisely tandemly repeated as in S^, but instead are embedded 
in larger repeat units. The S yl region has an additional 

3 0 higher-order structure: two direct repeat sequences flank 
each of two clusters of 49-bp tandem repeats. ( See M. R. 
Mowatt et al., J. Immunol. 136 ; 2674-2683 (1986), which is 
incorporated herein by reference) . 

Switch regions of human H chain genes have been 

35 found to be very similar to their mouse homologs. Indeed, 

similarity between pairs of human and mouse clones 5 1 to the 
C H genes has been found to be confined to the S regions, a fact 
that confirms the biological significance of these regions. 
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A switch recombination between \x and a genes 
produces a composite S^-S a sequence. Typically, there is no 
specific site, either in or in any other S region, where 
the recombination always occurs. 
5 Generally, unlike the enzymatic machinery of V-J 

recombination, the switch machinery can apparently accommodate 
different alignments of the repeated homologous regions of 
germline S precursors and then join the sequences at different 
positions within the alignment. ( See , T. H. Rabbits et al., 

10 Nucleic Acids Res . 9:4509-4524 (1981) and J. Ravetch et al. , 
Proc. Natl. Acad. Sci. USA 77:6734-6738 (1980), which are 
incorporated herein by reference.) 

The exact details of the mechanism (s) of selective 
activation of switching to a particular isotype are unknown. 

15 Although exogenous influences such as lymphokines and 

cytokines might upregulate isotype-specif ic recombinases , it 
is also possible that the same enzymatic machinery catalyzes 
switches to all isotypes and that specificity lies in 
targeting this machinery to specific switch regions. 

20 The T-cell-derived lymphokines IL-4 and IFN y have 

been shown to specifically promote the expression of certain 
isotypes: in the mouse, IL-4 decreases IgM, IgG2a, IgG2b, and 
IgG3 expression and increases IgE and IgGl expression; while 
IFN y selectively stimulates IgG2a expression and antagonizes 

25 the IL-4-induced increase in IgE and IgGl expression (Coffman 
et al., J. Immunol . 136: 949 (1986) and Snapper et al., 
Science 236 : 944 (1987), which are incorporated herein by 
reference) . A combination of IL-4 and IL-5 promotes IgA 
expression (Coffman et al., J. Immunol . 139 : 3685 (1987), 

3 0 which is incorporated herein by reference) . 

Most of the experiments implicating T-cell effects 
on switching have not ruled out the possibility that the 
observed increase in cells with particular switch 
recombinations might reflect selection of preswitched or 

3 5 precommitted cells; but the most likely explanation is that 
the lymphokines actually promote switch recombination. 

Induction of class switching appears to be 
associated with sterile transcripts that initiate upstream of 
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the switch segments (Lutzker et al., Mol. Cell. Biol , 8:1849 
(1988); Stavnezer et al. f Proc. Natl. Acad. Sci. USA 85 : 7704 
(1988); Esser and Radbruch, EMBO J . 8:483 (1989); Berton et 
al., Proc. Natl. Acad. Sci. USA 86:2829 (1989); Rothman et 
5 al., Int. Immunol . .2:621 (1990), each of which is incorporated 
by reference) . For example, the observed induction of the 7I 
sterile transcript by IL-4 and inhibition by IFN-7 correlates 
with the observation that IL-4 promotes class switching to 7I 
in B-cells in culture, while IFN-7 inhibits 7I expression. 

10 Therefore, the inclusion of regulatory sequences that affect 
the transcription of sterile transcripts may also affect the 
rate of isotype switching. For example, increasing the 
transcription of a particular sterile transcript typically can 
be expected to enhance the frequency of isotype switch 

15 recombination involving adjacent switch sequences. 

For these reasons, it is preferable that transgenes 
incorporate transcriptional regulatory sequences within about 
1-2 kb upstream of each switch region that is to be utilized 
for isotype switching. These transcriptional regulatory 

2 0 sequences preferably include a promoter and an enhancer 

element, and more preferably include the 5 1 flanking (i.e., 
upstream) region that is naturally associated (i.e., occurs in 
germline configuration) with a switch region. This 5' 
flanking region is typically about at least 50 nucleotides in 
25 length, preferably about at least 200 nucleotides in length, 
and more preferably at least 500-1000 nucleotides. 

Although a 5 1 flanking sequence from one switch 
region can be operably linked to a different switch region for 
transgene construction (e.g., a 5' flanking sequence from the 

3 0 human S yl switch can be grafted immediately upstream of the S al 

switch; a murine S yl flanking region can be grafted adjacent to 
a human 71 switch sequence; or the murine S yl switch can be 
grafted onto the human 7I coding region) , in some embodiments 
it is preferred that each switch region incorporated in the 
35 transgene construct have the 5 1 flanking region that occurs 
immediately upstream in the naturally occurring germline 
configuration . 
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Monoclonal Antibodies 

Monoclonal antibodies can be obtained by various 
techniques familiar to those skilled in the art* Briefly, 
spleen cells from an animal immunized with a desired antigen 
5 are immortalized, commonly by fusion with a myeloma cell ( see , 
Kohler and Milstein, Eur. J. Immunol. , 6:511-519 (1976)). 
Alternative methods of immortalization include transformation 
with Epstein Barr Virus, oncogenes, or retroviruses, or other 
methods well known in the art. Colonies arising from single 

10 immortalized cells are screened for production of antibodies 
of the desired specificity and affinity for the antigen, and 
yield of the monoclonal antibodies produced by such cells may 
be enhanced by various techniques, including injection into 
the peritoneal cavity of a vertebrate host. Various 

15 techniques useful in these arts are discussed, for example, in 
Harlow and Lane, Antibodies: A Laboratory Manual , Cold Spring 
Harbor, New York (1988) including: immunization of animals to 
produce immunoglobulins; production of monoclonal antibodies; 
labeling immunoglobulins for use as probes; immunoaf f inity 

20 purification; and immunoassays. 

The Transgenic Primary Repertoire 
A. The Human Immunoglobulin Loci 

An important requirement for transgene function is 

2 5 the generation of a primary antibody repertoire that is 

diverse enough to trigger a secondary immune response for a 
wide range of antigens. The rearranged heavy chain gene 
consists of a signal peptide exon, a variable region exon and 
a tandem array of multi-domain constant region regions, each 

3 0 of which is encoded by several exons. Each of the constant 

region genes encode the constant portion of a different class 
of immunoglobulins. During B-cell development, V region 
proximal constant regions are deleted leading to the 
expression of new heavy chain classes. For each heavy chain 
3 5 class, alternative patterns of RNA splicing give rise to both 
transmembrane and secreted immunoglobulins. 

The human heavy chain locus is estimated to consist 
of approximately 200 V gene segments (current data supports 
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the existence of about 50-100 V gene segments) spanning 2 Mb, 
approximately 30 D gene segments spanning about 4 0 kb, six J 
segments clustered within a 3 kb span, and nine constant 
region gene segments spread out over approximately 3 00 kb. 
5 The entire locus spans approximately 2.5 Mb of the distal 
portion of the long arm of chromosome 14. 

B. Gene Fragment Transaenes 

1. Heavy Chain Transgene 

10 In a preferred embodiment, immunoglobulin heavy and 

light chain transgenes comprise unrearranged genomic DNA from 
humans. In the case of the heavy chain, a preferred transgene 
comprises a NotI fragment having a length between 67 0 to 83 0 
kb. The length of this fragment is ambiguous because the 3 1 

15 restriction site has not been accurately mapped. It is known, 
however, to reside between the al and \(/a gene segments. This 
fragment contains members of all six of the known V H families, 
the D and J gene segments, as well as the ji , 8, y3 , yl and al 
constant regions (Berman et al. , EMBO J . 7:727-738 (1988), 

2 0 which is incorporated herein by reference) . A transgenic 
mouse line containing this transgene correctly expresses a 
heavy chain class required for B-cell development (IgM) and at 
least one switched heavy chain class (IgG x ) , in conjunction 
with a sufficiently large repertoire of variable regions to 

25 trigger a secondary response for most antigens. 

2 . Light Chain Transgene 

A genomic fragment containing all of the necessary 
gene segments and regulatory sequences from a human light 
30 chain locus may be similarly constructed. Such transgenes are 
constructed as described in the Examples and in copending 
application, entitled "Transgenic Non-Human Animals Capable 
of Producing Heterologous Antibodies," filed August 29, 1990, 
under U.S. S.N. 07/574,748. 



35 



Transgenes Generated Intracellular ly 
bv In Vivo Recombination 
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It is not necessary to isolate the all or part of 
the heavy chain locus on a single DNA fragment. Thus, for 
example, the 670-83 0 kb NotI fragment from the human 
immunoglobulin heavy chain locus may be formed in vivo in the 
5 non-human animal during transgenesis . Such in vivo transgene 
construction is produced by introducing two or more 
overlapping DNA fragments into an embryonic nucleus of the 
non-human animal. The overlapping portions of the DNA 
fragments have DNA sequences which are substantially 

10 homologous. Upon exposure to the recombinases contained 

within the embryonic nucleus, the overlapping DNA fragments 
homologously recombined in proper orientation to form the 
670-830 kb NotI heavy chain fragment. 

In vivo transgene construction can be used to form 

15 any number of immunoglobulin transgenes which because of their 
size are otherwise difficult, or impossible, to make or 
manipulate by present technology. Thus, in vivo transgene 
construction is useful to generate immunoglobulin transgenes 
which are larger than DNA fragments which may be manipulated 

20 by YAC vectors (Murray and Szostak, Nature 305 : 189-193 

(1983)). Such in vivo transgene construction may be used to 
introduce into a non-human animal substantially the entire 
immunoglobulin loci from a species not consisting of the 
transgenic non-human animal . 

25 In addition to forming genomic immunoglobulin 

transgenes, in vivo homologous recombination may also be 
utilized to form "mini-locus" transgenes as described in the 
Examples . 

In the preferred embodiments utilizing in vivo 
3 0 transgene construction, each overlapping DNA fragment 

preferably has an overlapping substantially homologous DNA 
sequence between the end portion of one DNA fragment and the 
end portion of a second DNA fragment. Such overlapping 
portions of the DNA fragments preferably comprise about 500 bp 
3 5 to about 2000 bp, most preferably 1.0 kb to 2.0 kb. Homologous 
recombination of overlapping DNA fragments to form transgenes 
in vivo is further described in commonly assigned U.S. Patent 
Application entitled "Intracellular Generation of DNA by 
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Homologous Recombination of DNA Fragments" filed August 29, 
1990, under U. S.S.N. 07/574,747. 



D. Minilocus Transaenes 
5 As used herein, the term "immunoglobulin minilocus" 

refers to a DNA sequence (which may be within a longer 
sequence) , usually of less than about 150 kb, typically 
between about 2 5 and 100 kb, containing at least one each of 
the following: a functional variable (V) gene segment, a 

10 functional joining (J) region segment, at least one functional 
constant (C) region gene segment, and — if it is a heavy chain 
minilocus — a functional diversity (D) region segment, such 
that said DNA sequence contains at least one substantial 
discontinuity (e.g. , a deletion, usually of at least about 2 

15 to 5 kb, preferably 10-25 kb or more, relative to the 

homologous genomic DNA sequence) . A light chain minilocus 
transgene will be at least 25 kb in length, typically 50 to 60 
kb. A heavy chain transgene will typically be about 70 to 80 
kb in length, preferably at least about 60 kb with two 

2 0 constant regions operably linked to switch regions. 

Furthermore , the individual elements of the minilocus are 
preferably in the germline configuration and capable of 
undergoing gene rearrangement in the pre-B cell of a 
transgenic animal so as to express functional antibody 

2 5 molecules with diverse antigen specificities encoded entirely 

by the elements of the minilocus. Further, a heavy chain 
minilocus comprising at least two C H genes and the requisite 
switching sequences is typically capable of undergoing isotype 
switching, so that functional antibody molecules of different 

3 0 immunoglobulin classes will be generated. Such isotype 

switching may occur in vivo in B-cells residing within the 
transgenic nonhuman animal, or may occur in cultured cells of 
the B-cell lineage which have been explanted from the 
transgenic nonhuman animal. 
35 In an alternate preferred embodiment, immunoglobulin 

heavy chain transgenes comprise one or more of each of the V H , 
D, and J H gene segments and two or more of the C H genes. At 
least one of each appropriate type gene segment is 
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incorporated into the minilocus transgene. With regard to the 
C H segments for the heavy chain transgene, it is preferred 
that the transgene contain at least one \i gene segment and at 
least one other constant region gene segment, more preferably 
5 a 7 gene segment, and most preferably 73 or 7I. This 

preference is to allow for class switching between IgM and IgG 
forms of the encoded immunoglobulin and the production of a 
secretable form of high affinity non-IgM immunoglobulin. 
Other constant region gene segments may also be used such as 

10 those which encode for the production of IgD, IgA and IgE. 

Those skilled in the art will also construct 
transgenes wherein the order of occurrence of heavy chain C H 
genes will be different from the naturally-occurring spatial 
order found in the germline of the species serving as the 

15 donor of the C H genes. 

Additionally, those skilled in the art can select C H 
genes from more than one individual of a species (e.g., 
allogeneic C H genes) and incorporate said genes in the 
transgene as supernumerary C H genes capable of undergoing 

20 isotype switching; the resultant transgenic nonhuman animal 
may then, in some embodiments, make antibodies of various 
classes including all of the allotypes represented in the 
species from which the transgene C H genes were obtained. 

Still further, those skilled in the art can select 

25 C H genes from different species to incorporate into the 

transgene. Functional switch sequences are included with each 
C H gene, although the switch sequences used are not 
necessarily those which occur naturally adjacent to the C H 
gene. Interspecies C H gene combinations will produce a 

3 0 transgenic nonhuman animal which may produce antibodies of 
various classes corresponding to C H genes from various 
species. Transgenic nonhuman animals containing interspecies 
C H transgenes may serve as the source of B-cells for 
constructing hybridomas to produce monoclonals for veterinary 

3 5 uses. 

The heavy chain J region segments in the human 
comprise six functional J segments and three pseudo genes 
clustered in a 3 kb stretch of DNA. Given its relatively 
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compact size and the ability to isolate these segments 
together with the \i gene and the 5 ' portion of the 8 gene on a 
single 23 kb SFil/Spel fragment (Sado et al., Biochem. 
Biophys. Res, Comm. 154 : 264271 (1988), which is incorporated 
5 herein by reference) , it is preferred that all of the J region 
gene segments be used in the mini-locus construct. Since this 
fragment spans the region between the jjl and S genes, it is 
likely to contain all of the 3 1 cis-linked regulatory elements 
required for jit expression. Furthermore, because this fragment 

10 includes the entire J region, it contains the heavy chain 

enhancer and the /x switch region (Mills et al., Nature 306 :809 
(1983); Yancopoulos and Alt, Ann . Rev. Immunol . 4:339-368 
(1986) , which are incorporated herein by reference) . It also 
contains the transcription start sites which trigger VDJ 

15 joining to form primary repertoire B-cells (Yancopoulos and 
Alt, Cell 40:271-281 (1985), which is incorporated herein by 
reference) . Alternatively, a 36 kb BssHII/Spell fragment, 
which includes part on the D region, may be used in place of 
the 23 kb Sfil/ Spell fragment. The use of such a fragment 

20 increases the amount of 5* flanking sequence to facilitate 
efficient D-to-J joining. 

The human D region consists of 4 homologous 9 kb 
subregions, linked in tandem (Siebenlist, et al. (1981), 
Nature , 294 , 631-635) . Each subregion contains up to 10 

25 individual D segments. Some of these segments have been 

mapped and are shown in Fig. 4. Two different strategies are 
used to generate a mini-locus D region. The first strategy 
involves using only those D segments located in a short 
contiguous stretch of DNA that includes one or two of the 

3 0 repeated D subregions. A candidate is a single 15 kb fragment 
that contains 12 individual D segments. This piece of DNA 
consists of 2 contiguous EcoRI fragments and has been 
completely sequenced (Ichihara, et al. (1988) , EMBO J * , Z, 
4141-4150) . Twelve D segments should be sufficient for a 

3 5 primary repertoire. However, given the dispersed nature of 
the D region, an alternative strategy is to ligate together 
several non-contiguous D-segment containing fragments, to 
produce a smaller piece of DNA with a greater number of 
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segments. Additional D-segment genes can be identified, for 
example, by the presence of characteristic flanking nonamer 
and heptamer sequences, supra, and by reference to the 
literature. 

5 At least one, and preferably more than one V gene 

segment is used to construct the heavy chain minilocus 
transgene. Rearranged or unrearranged V segments with or 
without flanking sequences can be isolated as described in 
copending applications, U.S. S.N. 07/574,748 filed August 29, 

10 1990, PCT/US91/06185 filed August 28, 1991, and U.S. S.N. 
07/810,279 filed December 17, 1991, each of which is 
incorporated herein by reference. 

Rearranged or unrearranged V segments, D segments, J 
segments, and C genes, with or without flanking sequences, can 

15 be isolated as described in copending applications U.S. S.N. 
07/574,748 filed August 29, 1990 and PCT/US91/06185 filed 
August 28, 1991. 

A minilocus light chain transgene may be similarly 
constructed from the human X or k immunoglobulin locus. 

2 0 Thus, for example, an immunoglobulin heavy chain minilocus 

transgene construct, e.g., of about 75 kb, encoding V, D, J 
and constant region sequences can be formed from a plurality 
of DNA fragments, with each sequence being substantially 
homologous to human gene sequences. Preferably, the sequences 

25 are operably linked to transcription regulatory sequences and 
are capable of undergoing rearrangement. With two or more 
appropriately placed constant region sequences (e.g., ji and 7) 
and switch regions, switch recombination also occurs. An 
exemplary light chain transgene construct can be formed 

30 similarly from a plurality of DNA fragments, substantially 
homologous to human DNA and capable of undergoing 
rearrangement, as described in copending application, U.S. S.N. 
07/574,748 filed August 29, 1990. 

3 5 E. Transgene Constructs Capable of Isotype Switching 

Ideally, transgene constructs that are intended to 
undergo class switching should include all of the cis-acting 
sequences necessary to regulate sterile transcripts. 
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Naturally occurring switch regions and upstream promoters and 
regulatory sequences (e.g., IFN-inducible elements) are 
preferred cis-acting sequences that are included in transgene 
constructs capable of isotype switching. About at least 50 
5 basepairs, preferably about at least 200 basepairs, and more 
preferably at least 500 to 1000 basepairs or more of sequence 
immediately upstream of a switch region, preferably a human 71 
switch region, should be operably linked to a switch sequence, 
preferably a human 7I switch sequence. Further, switch 

10 regions can be linked upstream of (and adjacent to) C H genes 
that do not naturally occur next to the particular switch 
region. For example, but not for limitation, a human y ± 
switch region may be linked upstream from a human a 2 C H gene, 
or a murine y x switch may be linked to a human C H gene. 

15 An alternative method for obtaining non-classical 

isotype switching (e.g., ^-associated deletion) in transgenic 
mice involves the inclusion of the 400 bp direct repeat 
sequences (ay, and e/x) that flank the human & gene (Yasui et 
al., Eur. J. Immunol. 19 :13 99 (1989)). Homologous 

2 0 recombination between these two sequences deletes the /x gene 

in IgD-only B-cells. Heavy chain transgenes can be 
represented by the following formulaic description: 

(V H ) x -(D) y -(J H ) B -(S D ) m -(C 1 ) n -[(T)-(S A ) p -(C 2 ) ] q 

25 

where : 

V H is a heavy chain variable region gene segment, 

D is a heavy chain D (diversity) region gene segment, 

J H is a heavy chain J (joining) region gene segment, 

3 0 S D is a donor region segment capable of participating in 

a recombination event with the S a acceptor region 

segments such that isotype switching occurs, 

C x is a heavy chain constant region gene segment encoding 

an isotype utilized in for B cell development (e.g., 
35 fJL or 8) , 

T is a cis-acting transcriptional regulatory region 

segment containing at least a promoter, 
S A is an acceptor region segment capable of participating 



in a recombination event with selected S D donor 
region segments, such that isotype switching occurs, 
C 2 is a heavy chain constant region gene segment encoding 
an isotype other than /j (e.g., y lf y 2 , 73/ 74/ a ir 
a 2 , e ) . 

x, y, z, m, n, p, and q are integers. x is 1-100, n is 
0-10, y is 1-50, p is 1-10, z is 1-50, q is 0-50, m 
is 0-10. Typically, when the transgene is capable 
of isotype switching, q must be at least 1, m is at 
least 1, n is at least 1, and m is greater than or 
equal to n. 

V H' D ' j h/ S D' c l' T ' s a/ and c z segments may be 
selected from various species, preferably mammalian species, 
and more preferably from human and murine germline DNA. 

V H segments may be selected from various species, 
but are preferably selected from V H segments that occur 
naturally in the human germline, such as V H251 . Typically 
about 2 V H gene segments are included, preferably about 4 V H 
segments are included, and most preferably at least about 10 
V H segments are included. 

At least one D segment is typically included, 
although at least 10 D segments are preferably included, and 
some embodiments include more than ten D segments. Some 
preferred embodiments include human D segments. 

Typically at least one J H segment is incorporated in 
the transgene, although it is preferable to include about six 
J H segments, and some preferred embodiments include more than 
about six J H segments. Some preferred embodiments include 
human J H segments, and further preferred embodiments include 
six human J H segments and no nonhuman J H segments. 

S D segments are donor regions capable of 
participating in recombination events with the S A segment of 
the transgene. For classical isotype switching, S D and S A are 
switch regions such as S M , S Yl , S y2 , S y3f S y4 , S a , S a2 / and S e . 
Preferably the switch regions are murine or human, more 
preferably S D is a human or murine S^ and S A is a human or 
murine S vl . For nonclassical isotype switching ( ^-associated 
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deletion) , S D and S A are preferably the 400 basepair direct 
repeat sequences that flank the human fi gene. 

C ± segments are typically /i or <S genes, preferably a 
fj. gene, and more preferably a human or murine /x gene. 

T segments typically include S 1 flanking sequences 
that are adjacent to naturally occurring (i.e., germline) 
switch regions. T segments typically at least about at least 
50 nucleotides in length, preferably about at least 200 
nucleotides in length, and more preferably at least 500-1000 
nucleotides in length. Preferably T segments are 5' flanking 
sequences that occur immediately upstream of human or murine 
switch regions in a germline configuration. It is also 
evident to those of skill in the art that T segments may 
comprise cis-acting transcriptional regulatory sequences that 
do not occur naturally in an animal germline (e.g., viral 
enhancers and promoters such as those found in SV40, 
adenovirus, and other viruses that infect eukaryotic cells) . 

C 2 segments are typically a y lr y 2 , y 3 , 7 4 , a x , a 2 , 
0^ € C H gene/ pre ferably a human c« of these isotypes , and 

more preferably a human y x or 73 gene. Murine 7 2a and 7 2b may 
also be used, as may downstream (i.e., switched) isotype genes 
form various species. Where the heavy chain transgene 
contains an immunoglobulin heavy chain minilocus, the total 
length of the transgene will be typically 150 kilo basepairs 
or less. 

In general, the transgene will be other than a 
native heavy chain Ig locus. Thus, for example, deletion of 
unnecessary regions or substitutions with corresponding 
regions from other species will be present. 

F. Methods for Determining Functional 
Isotype Switching in la Transaenes 

The occurrence of isotype switching in a transgenic 
nonhuman animal may be identified by any method known to those 
in the art. Preferred embodiments include the following, 
employed either singly or in combination: 

1. detection of mRNA transcripts that contain a sequence 
homologous to at least one transgene downstream C H gene other 



than S and an adjacent sequence homologous to a transgene V H - 
D H -J H rearranged gene; such detection may be by Northern 
hybridization, S x nuclease protection assays, PCR 
amplification, cDNA cloning, or other methods; 

2. detection in the serum of the transgenic animal, or in 
supernatants of cultures of hybridoma cells made from B-cells 
of the transgenic animal, of immunoglobulin proteins encoded 
by downstream C H genes, where such proteins can also be shown 
by immunochemical methods to comprise a functional variable 
region; 

3. detection, in DNA from B-cells of the transgenic 
animal or in genomic DNA from hybridoma cells, of DNA 
rearrangements consistent with the occurrence of isotype 
switching in the transgene, such detection may be accomplished 
by Southern blot hybridization, PCR amplification, genomic 
cloning, or other method; or 

4. identification of other indicia of isotype switching, 
such as production of sterile transcripts, production of 
characteristic enzymes involved in switching (e.g., "switch 
recombinase") , or other manifestations that may be detected, 
measured, or observed by contemporary techniques. 

Because each transgenic line may represent a 
different site of integration of the transgene, and a 
potentially different tandem array of transgene inserts, and 
because each different configuration of transgene and flanking 
DNA sequences can affect gene expression, it is preferable to 
identify and use lines of mice that express high levels of 
human immunoglobulins, particularly of the IgG isotype, and 
contain the least number of copies of the transgene. Single 
copy transgenics minimize the potential problem of incomplete 
allelic expression. Transgenes are typically integrated into 
host chromosomal DNA, most usually into germline DNA and 
propagated by subsequent breeding of germline transgenic 
breeding stock animals. However, other vectors and transgenic 
methods known in the present art or subsequently developed may 
be substituted as appropriate and as desired by a 
practitioner . 
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Trans-switching to endogenous nonhuman heavy chain 
constant region genes can occur and produce chimeric heavy 
chains and antibodies comprising such chimeric human/mouse 
heavy chains. Such chimeric antibodies may be desired for 
5 certain uses described herein or may be undesirable. 

G. Functional Disruption of 

Endogenous Immunoglobulin Loci 

The expression of successfully rearranged 

10 immunoglobulin heavy and light transgenes is expected to have 
a dominant effect by suppressing the rearrangement of the 
endogenous immunoglobulin genes in the transgenic nonhuman 
animal. However, another way to generate a nonhuman that is 
devoid of endogenous antibodies is by mutating the endogenous 

15 immunoglobulin loci. Using embryonic stem cell technology and 
homologous recombination, the endogenous immunoglobulin 
repertoire can be readily eliminated. The following describes 
the functional description of the mouse immunoglobulin loci. 
The vectors and methods disclosed, however, can be readily 

20 adapted for use in other non-human animals. 

Briefly, this technology involves the inactivation 
of a gene, by homologous recombination, in a pluripotent cell 
line that is capable of differentiating into germ cell tissue. 
A DNA construct that contains an altered, copy of a mouse 

25 immunoglobulin gene is introduced into the nuclei of embryonic 
stem cells. In a portion of the cells, the introduced DNA 
recombines with the endogenous copy of the mouse gene, 
replacing it with the altered copy. Cells containing the 
newly engineered genetic lesion are injected into a host mouse 

3 0 embryo, which is reimplanted into a recipient female. Some of 
these embryos develop into chimeric mice that possess germ 
cells entirely derived from the mutant cell line. Therefore, 
by breeding the chimeric mice it is possible to obtain a new 
line of mice containing the introduced genetic lesion 

35 (reviewed by Capecchi (1989), Science , 244 , 1288-1292). 

Because the mouse X locus contributes to only 5% of 
the immunoglobulins, inactivation of the heavy chain and/or 
*-light chain loci is sufficient. There are three ways to 
disrupt each of these loci, deletion of the J region, deletion 



of the J-C intron enhancer, and disruption of constant region 
coding sequences by the introduction of a stop codon. The 
last option is the most straightforward, in terms of DNA 
construct design. Elimination of the jti gene disrupts B-cell 
maturation thereby preventing class switching to any of the 
functional heavy chain segments. The strategy for knocking 
out these loci is outlined below. 

To disrupt the mouse and k genes, targeting 
vectors are used based on the design employed by Jaenisch and 
co-workers (Zijlstra, et al. (1989), Nature, 342, 435-438) for 
the successful disruption of the mouse ^-microglobulin gene. 
The neomycin resistance gene (neo) , from the plasmid pMCIneo 
is inserted into the coding region of the target gene. The 
pMCIneo insert uses a hybrid viral promoter/ enhancer sequence 
to drive neo expression. This promoter is active in embryonic 
stem cells. Therefore, neo can be used as a selectable marker 
for integration of the knock-out construct. The HSV thymidine 
kinase (tk) gene is added to the end of the construct as a 
negative selection marker against random insertion events 
(Zijlstra, et al. , supra . ) . 

A preferred strategy for disrupting the heavy chain 
locus is the elimination of the J region. This region is 
fairly compact in the mouse, spanning only 1.3 kb. To 
construct a gene targeting vector, a 15 kb Kpnl fragment 
containing all of the secreted A constant region exons from 
mouse genomic library is isolated. The 1.3 kb J region is 
replaced with the 1.1 kb insert from pMCIneo. The HSV tk gene 
is then added to the 5 1 end of the Kpnl fragment. Correct 
integration of this construct, via homologous recombination, 
will result in the replacement of the mouse J H region with the 
neo gene. Recombinants are screened by PCR, using a primer 
based on the neo gene and a primer homologous to mouse 
sequences 5* of the Kpnl site in the D region. 

Alternatively, the heavy-chain locus is knocked out 
by disrupting the coding region of the ju gene. This approach 
involves the same 15 kb Kpnl fragment used in the previous 
approach. The 1.1 kb insert from pMCIneo is inserted at a 
unique BamHI site in exon II, and the HSV tk gene added to the 
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3 1 Kpnl end. Double crossover events on either side of the 
neo insert, that eliminate the tk gene, are then selected for. 
These are detected from pools of selected clones by PCR 
amplification. One. of the PCR primers is derived from neo 
sequences and the other from mouse sequences outside of the 
targeting vector. The functional disruption of the mouse 
immunoglobulin loci is presented in the Examples. 

G. Suppressing Expression of 

Endogenous Immunoglobulin Loci 

In addition to functional disruption of endogenous 

Ig loci, an alternative method for preventing the expression 

of an endogenous Ig locus is suppression. Suppression of 

endogenous Ig genes may be accomplished with antisense RNA 

produced from one or more integrated transgenes, by antisense 

oligonucleotides, and/or by administration of antisera 

specific for one or more endogenous Ig chains. 

Antisense Polynucleotides 
Antisense RNA transgenes can be employed to 
partially or totally knock-out expression of specific genes 
(Pepin et al. (1991) Nature 355 : 725; Helene. , C. and Toulme, 
J. (1990) Biochimica Biophys. Acta 1049: 99; Stout, J. and 
Caskey, T. (1990) Somat. Cell Mol. Genet. 16: 369; Munir et 
al. (1990) Somat. Cell Mol. Genet. 16 : 383, each of which is 
incorporated herein by reference) . 

"Antisense polynucleotides" are polynucleotides 
that: (1) are complementary to all or part of a reference 
sequence, such as a sequence of an endogenous Ig C H or C L 
region, and (2) which specifically hybridize to a 
complementary target sequence, such as a chromosomal gene 
locus or a Ig mRNA. Such complementary antisense 
polynucleotides may include nucleotide substitutions, 
additions, deletions, or transpositions, so long as specific 
hybridization to the relevant target sequence is retained as 
functional property of the polynucleotide. Complementary 
antisense polynucleotides include soluble antisense RNA or DN 
oligonucleotides which can hybridize specifically to 



individual mRNA species and prevent transcription and/ or RNA 
processing of the mRNA species and/or translation of the 
encoded polypeptide (Ching et al. , Proc. Natl. Ac ad. Sci. 
U, s.A. 86:10006-10010 (1989); Broder et al., Ann. Int. Med . 
113 : 604-618 (1990); Loreau et al. , FEBS Letters 274 : 53-56 
(1990); Holcenberg et al., W091/11535; U.S. S.N. 07/530,165 
("New human CRIPTO gene"); WO91/09865; WO91/04753; WO90/13641; 
and EP 386563, each of which is incorporated herein by 
reference) . An antisense sequence is a polynucleotide 
sequence that is complementary to at least one immunoglobulin 
gene sequence of at least about 15 contiguous nucleotides in 
length, typically at least 2 0 to 3 0 nucleotides in length, and 
preferably more than about 3 0 nucleotides in length. However, 
in some embodiments, antisense sequences may have 
substitutions, additions, or deletions as compared to the 
complementary immunoglobulin gene sequence, so long as 
specific hybridization is retained as a property of the 
antisense polynucleotide. Generally, an antisense sequence is 
complementary to an endogenous immunoglobulin gene sequence 
that encodes, or has the potential to encode after DNA 
rearrangement, an immunoglobulin chain. In some cases, sense 
sequences corresponding to an immunoglobulin gene sequence may 
function to suppress expression, particularly by interfering 
with transcription. 

The antisense polynucleotides therefore inhibit 
production of the encoded polypeptide (s) . In this regard, 
antisense polynucleotides that inhibit transcription and/ or 
translation of one or more endogenous Ig loci can alter the 
capacity and/or specificity of a non-human animal to produce 
immunoglobulin chains encoded by endogenous Ig loci. 

Antisense polynucleotides may be produced from a 
heterologous expression cassette in a transfectant cell or 
transgenic cell, such as a transgenic pluripotent 
hematopoietic stem cell used to reconstitute all or part of 
the hematopoietic stem cell population of an individual, or a 
transgenic nonhuman animal. Alternatively, the antisense 
polynucleotides may comprise soluble oligonucleotides that are 
administered to the external milieu, either in culture medium 



in vitro or in the circulatory system or interstitial fluid in 
vivo . Soluble antisense polynucleotides present in the 
external milieu have been shown to gain access to the 
cytoplasm and inhibit translation of specific mRNA species. In 
some embodiments the antisense polynucleotides comprise 
methylphosphonate moieties, alternatively phosphorothiolates 
or O-methylribonucleotides may be used, and chimeric 
oligonucleotides may also be used (Dagle et al. (1990) Nucleic 
Acids Res. 18: 4751) . For some applications, antisense 
oligonucleotides may comprise polyamide nucleic acids (Nielsen 
et al. (1991) Science 254 : 1497). For general methods 
relating to antisense polynucleotides, see Antisen se RNA and 
DNA . (1988), D.A. Melton, Ed., Cold Spring Harbor Laboratory, 
Cold Spring Harbor, NY) . 

Antisense polynucleotides complementary to one or 
more sequences are employed to inhibit transcription, RNA 
processing, and/or translation of the cognate mRNA species and 
thereby effect a reduction in the amount of the respective 
encoded polypeptide. Such antisense polynucleotides can 
provide a therapeutic function by inhibiting the formation of 
one or more endogenous Ig chains in vivo* 

Whether as soluble antisense oligonucleotides or as 
antisense RNA transcribed from an antisense transgene, the 
antisense polynucleotides of this invention are selected so as 
to hybridize preferentially to endogenous Ig sequences at 
physiological conditions in vivo . Most typically, the 
selected antisense polynucleotides will not appreciably 
hybridize to heterologous Ig sequences encoded by a heavy or 
light chain transgene of the invention (i.e., the antisense 
oligonucleotides will not inhibit transgene Ig expression by 
more than about 2 5 to 35 percent) . 

Antiserum Suppression 
Partial or complete suppression of endogenous Ig 
chain expression can be produced by injecting mice with 
antisera against one or more endogenous Ig chains (Weiss et 
al. (1984) Proc. Natl. Acad. Sci. fU.S.A.) 81 211, which is 
incorporated herein by reference) . Antisera are selected so 
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as to react specifically with one or iriore endogenous (e.g., 
murine) Ig chains but to have minimal or no cross-reactivity 
with heterologous Ig chains encoded by an Ig transgene of the 
invention. Thus, administration of selected antisera 
according to a schedule as typified by that of Weiss et al. 
op. cit . will suppress endogenous Ig chain expression but 
permits expression of heterologous Ig chain (s) encoded by a 
transgene of the present invention. Suitable antibody sources 
for antibody comprise: 

(1) monoclonal antibodies, such as a monoclonal 
antibody that specifically binds to a murine /x, y, k, or X 
chains but does not react with the human immunoglobulin 
chain (s) encoded by a human Ig transgene of the invention; 

(2) mixtures of such monoclonal antibodies, so that 
the mixture binds with multiple epitopes on a single species 
of endogenous Ig chain, with multiple endogenous Ig chains 
(e.g., murine /i and murine y, or with multiple epitopes and 
multiple chains or endogenous immunoglobulins; 

(3) polyclonal antiserum or mixtures thereof, 
typically such antiserum/antisera is monospecific for binding 
to a single species of endogenous Ig chain (e.g., murine fx , 
murine y, murine k, murine X) or to multiple species of 
endogenous Ig chain, and most preferably such antisera 
possesses negligible binding to human immunoglobulin chains 
encoded by a transgene of the invention; and/ or 

(4) a mixture of polyclonal antiserum and monoclonal 
antibodies binding to a single or multiple species of 
endogenous Ig chain, and most preferably possessing negligible 
binding to human immunoglobulin chains encoded by a transgene 
of the invention. Generally, polyclonal antibodies are 
preferred, and such substantially monospecific polyclonal 
antibodies can be advantageously produced from an antiserum 
raised against human immunoglobulin (s) by pre-adsorption with 
antibodies derived from the nonhuman animal species (e.g., 
murine) and/or, for example, by affinity chromatography of the 
antiserum or purified fraction thereof on an affinity resin 
containing immobilized human Ig (wherein the bound fraction is 
enriched for the desired anti-human Ig in the antiserum; the 
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bound fraction is typically eluted with conditions of low pH 
or a chaotropic salt solution) . 

Cell separation and/or complement fixation can be 
employed to provide the enhancement of antibody-directed cell 
5 depletion of lymphocytes expressing endogenous (e.g., murine) 
immunoglobulin chains. In one embodiment, for example, 
antibodies are employed for ex vivo depletion of murine Ig- 
expressing explanted hematopoietic cells and/or B-lineage 
lymphocytes obtained from a transgenic mouse harboring a human 

10 Ig transgene. Thus, hematopoietic cells and/or B-lineage 
lymphocytes are explanted from a transgenic nonhuman animal 
harboring a human Ig transgene (preferably harboring both a 
human heavy chain transgene and a human light chain transgene) 
and the explanted cells are incubated with an antibody (or 

15 antibodies) which (1) binds to an endogenous immunoglobulin 

(e.g., murine /x and/or k) and (2) lacks substantial binding to 
human immunoglobulin chains encoded by the transgene (s) . Such 
antibodies are referred to as "suppression antibodies" for 
clarity. The explanted cell population is selectively 

20 depleted of cells which bind to the suppression antibody ( ies) ; 
such depletion can be accomplished by various methods, such as 
(1) physical separation to remove suppression antibody-bound 
cells from unbound cells (e.g., the suppression antibodies may 
be bound to a solid support or magnetic bead to immobilize and 

25 remove cells binding to the suppression antibody) , (2) 
antibody-dependent cell killing of cells bound by the 
suppression antibody (e.g., by ADCC, by complement fixation, 
or by a toxin linked to the suppression antibody) , and (3) 
clonal anergy induced by the suppression antibody, and the 

3 0 like. 

Frequently, antibodies used for antibody suppression 
of endogenous Ig chain production will be capable of fixing 
complement. It is frequently preferable that such antibodies 
may be selected so as to react well with a convenient 
35 complement source for ex vivo / in vitro depletion, such as 

rabbit or guinea pig complement. For in vivo depletion, it is 
generally preferred that the suppressor antibodies possess 
effector functions in the nonhuman transgenic animal species; 
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thus, a suppression antibody comprising murine effector 
functions (e.g., ADCC and complement fixation) generally would 
be preferred for use in transgenic mice. 

In one variation, a suppression antibody that 
5 specifically binds to a predetermined endogenous 

immunoglobulin chain is used for ex vivo /in vitro depletion 
of lymphocytes expressing an endogenous immunoglobulin. A 
cellular explant (e.g., lymphocyte sample) from a transgenic 
nonhuman animal harboring a human immunoglobulin transgene is 

10 contacted with a suppression antibody and cells specifically 
binding to the suppression antibody are depleted (e.g., by 
immobilization, complement fixation, and the like) , thus 
generating a cell subpopulation depleted in cells expressing 
endogenous ( nonhuman ) immunoglobulins (e.g. , lymphocytes 

15 expressing murine Ig) . The resultant depleted lymphocyte 

population (T cells, human Ig-positive B-cells, etc.) can be 
transferred into a immunocompatible (i.e., MHC-compatible) 
nonhuman animal of the same species and which is substantially 
incapable of producing endogenous antibody (e.g., SCID mice, 

2 0 RAG-1 or RAG- 2 knockout mice) . The reconstituted animal 

(mouse) can then be immunized with an antigen (or reimmunized 
with an antigen used to immunize the donor animal from which 
the explant was obtained) to obtain high-affinity (affinity 
matured) antibodies and B-cells producing such antibodies. 
25 Such B-cells may be used to generate hybridomas by 

conventional cell fusion and screened. Antibody suppression 
can be used in combination with other endogenous Ig 
inactivation/suppression methods (e.g., J H knockout, C H 
knockout, D-region ablation, antisense suppression, 

3 0 compensated frameshift inactivation) . 

Complete Endogenous Ig Locus Inactivation 
In certain embodiments, it is desirable to effect 
complete inactivation of the endogenous Ig loci so that hybrid 
3 5 immunoglobulin chains comprising a human variable region and a 
non-human (e.g., murine) constant region cannot be formed 
(e.g., by trans-switching between the transgene and endogenous 
Ig sequences) . Knockout mice bearing endogenous heavy chain 
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alleles with are functionally disrupted in the J H region only 
frequently exhibit trans-switching, typically wherein a 
rearranged human variable region (VDJ) encoded by a transgene 
is expressed as a fusion protein linked to an endogenous 
5 murine constant region, although other trans-switched 

junctions are possible. To overcome this potential problem, 
it is generally desirable to completely inactivate the 
endogenous heavy chain locus by any of various methods, 
including but not limited to the following: (1) functionally 

10 disrupting and/ or deleting by homologous recombination at 
least one and preferably all of the endogenous heavy chain 
constant region genes, (2) mutating at least one and 
preferably all of the endogenous heavy chain constant region 
genes to encode a termination codon (or frameshift) to produce 

15 a truncated or frameshifted product (if trans-switched) , and 
other methods and strategies apparent to those of skill in the 
art. Deletion of a substantial portion or all of the heavy 
chain constant region genes and/ or D-region genes may be 
accomplished by various methods, including sequential deletion 

2 0 by homologous recombination targeting vectors, especially of 
the "hit-and-run" type and the like. Similarly, functional 
disruption and/ or deletion of at least one endogenous light 
chain locus (e.g., k) to ablate endogenous light chain 
constant region genes is often preferable. 

25 Frequently, it is desirable to employ a frameshifted 

transgene wherein the heterologous transgene comprises a 
frameshift in the J segment (s) and a compensating frameshift 
(i.e., to regenerate the original reading frame) in the 
initial region (i.e., amino-terminal coding portion) of one or 

30 more (preferably all) of the transgene constant region genes. 
Trans-switching to an endogenous IgH locus constant gene 
(which does not comprise a compensating frameshift) will 
result in a truncated or missense product that results in the 
trans-switched B cell being deleted or non-selected, thus 

35 suppressing the trans-switched phenotype. 

Antisense suppression and antibody suppression may 
also be used to effect a substantially complete functional 
inactivation of endogenous Ig gene product expression (e.g., 



murine heavy and light chain sequences) and/or trans-switched 
antibodies (e.g., human variable/murine constant chimeric 

antibodies) . 

Various combinations of the inactivation and 
suppression strategies may be used to effect essentially total 
suppression of endogenous (e.g., murine) Ig chain expression. 

Trans-Switching 
In some variations, it may be desirable to produce a 
trans-switched immunoglobulin. For example, such trans- 
switched heavy chains can be chimeric (i.e., a non-murine 
(human) variable region and a murine constant region) . 
Antibodies comprising such chimeric trans-switched 
immunoglobulins can be used for a variety of applications 
where it is desirable to have a non-human (e.g., murine) 
constant region (e.g., for retention of effector functions in 
the host, for the presence of murine immunological 
determinants such as for binding of a secondary antibody which 
does not bind human constant regions) . For one example, a 
human variable region repertoire may possess advantages as 
compared to the murine variable region repertoire with respect 
to certain antigens. Presumably the human V H , D, J H , V L , and 
J L genes have been selected for during evolution for their 
ability to encode immunoglobulins that bind certain 
evolutionarily important antigens; antigens which provided 
evolutionary selective pressure for the murine repertoire can 
be distinct from those antigens which provided evolutionary 
pressure to shape the human repertoire. Other repertoire 
adavantages may exist, making the human variable region 
repertoire advantageous when combined with a murine constant 
region (e.g., a trans-switched murine) isotype. The presence 
of a murine constant region can afford advantages over a human 
constant region. For example, a murine 7 constant region 
linked to a human variable region by trans-switching may 
provide an antibody which possesses murine effector functions 
(e.g., ADCC, murine complement fixation) so that such a 
chimeric antibody (preferably monoclonal) which is reactive 
with a predetermined antigen (e.g., human IL-2 receptor) may 
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be tested in a mouse disease model, such as a mouse model of 
graft-versus-host disease wherein the T lymphocytes in the 
mouse express a functional human IL-2 receptor. Subsequently, 
the human variable region encoding sequence may be isolated 
5 (e.g., by PCR amplification or cDNA cloning from the source 
(hybridoma clone) ) and spliced to a sequence encoding a 
desired human constant region to encode a human sequence 
antibody more suitable for human therapeutic uses where 
immunogenic ity is preferably minimized. The polynucleotide (s) 

10 having the resultant fully human encoding sequence (s) can be 

expressed in a host cell (e.g., from an expression vector in a 
mammalian cell) and purified for pharmaceutical formulation. 
For some applications, the chimeric antibodies may be used 
directly without replacing the murine constant region with a 

15 human constant region. Other variations and uses of trans- 
switched chimeric antibodies will be evident to those of skill 
in the art. 

The present invention provides transgenic nonhuman 
animals containing B lymphocytes which express chimeric 

2 0 antibodies, generally resulting from trans-switching between a 

human heavy chain transgene and an endogenous murine heavy 
chain constant region gene. Such chimeric antibodies comprise 
a human sequence variable region and a murine constant region, 
generally a murine switched (i.e., non-ju, non-5) isotype. The 
25 transgenic nonhuman animals capable of making chimeric 
antibodies to a predetermined antigen are usually also 
competent to make fully human sequence antibodies if both 
human heavy chain and human light chain transgenes encoding 
human variable and human constant region genes are integrated. 

3 0 Most typically, the animal is homozygous for a functionally 

disrupted heavy chain locus and/ or light chain locus but 
retains one or more endogenous heavy chain constant region 
gene(s) capable of trans-switching (e.g., y,ct, e) and 
frequently retains a cis-linked enhancer. Such a mouse is 
3 5 immunized with a predetermined antigen, usually in combination 
with an adjuvant, and an immune response comprising a 
detectable amount of chimeric antibodies comprising heavy 
chains composed of human sequence variable regions linked to 



murine constant region sequences is produced. Typically, the 
serum of such an immunized animal can comprise such chimeric 
antibodies at concentrations of about at least 1 /xg/ml, often 
about at least 10 jug/ml, frequently at least 3 0 /ig/ml, and up 
to 50 to 100 fig /ml or more. The antiserum containing 
antibodies comprising chimeric human variable/mouse constant 
region heavy chains typically also comprises antibodies which 
comprise human variable /human constant region (complete human 
sequence) heavy chains. Chimeric trans-switched antibodies 
usually comprise (1) a chimeric heavy chain composed of a 
human variable region and a murine constant region (typically 
a murine gamma) and (2) a human transgene-encoded light chain 
(typically kappa) or a murine light chain (typically lambda in 
a kappa knockout background) . Such chimeric trans-switched 
antibodies generally bind to a predetermined antigen (e.g., 
the immunogen) with an affinity of about at least 1 x 10 7 M" 1 , 
preferably with an affinity of about at least 5 x 10 7 M" 1 , more 
preferably with an affinity of at least 1 x 10 s M" 1 to 1 x 10 9 
M" 1 or more. Frequently, the predetermined antigen is a human 
protein, such as for example a human cell surface antigen 
(e.g., CD4, CD8, IL-2 receptor, EGF receptor, PDGF receptor), 
other human biological macromolecule (e.g., thrombomodulin, 
protein C, carbohydrate antigen, sialyl Lewis antigen, L- 
selectin) , or nonhuman disease associated macromolecule (e.g., 
bacterial LPS, virion capsid protein or envelope glycoprotein) 

and the like. 

The invention provides transgenic nonhuman animals 
comprising a genome comprising: (1) a homozygous functionally 
disrupted endogenous heavy chain locus comprising at least one 
murine constant region gene capable of trans-switching (e.g. , 
in cis linkage to a functional switch recombination sequence 
and typically to a functional enhancer) , (2) a human heavy 
chain transgene capable of rearranging to encode end express a 
functional human heavy chain variable region and capable of 
trans-switching (e.g., having a cis-linked RSS) ; optionally 
further comprising (3) a human light chain (e.g., kappa) 
transgene capable of rearranging to encode a functional human 
light chain variable region and expressing a human sequence 
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light chain; optionally further comprising (4) a homozygous 
functionally disrupted endogenous light chain locus ( k , 
preferably k and X) ; and optionally further comprising (5) a 
serum comprising an antibody comprising a chimeric heavy chain 
composed of a human sequence variable region encoded by a 
human transgene and a murine constant region sequence encoded 
by an endogenous murine heavy chain constant region gene 
(e.g., yl, y2a, y2h, y3) . 

Such transgenic mice may further comprise a serum 
comprising chimeric antibodies which bind a predetermined 
human antigen (e.g., CD4 , CD8, CEA) with an affinity of about 
at least 1 x 10 4 M" 1 , preferably with an affinity of about at 
least 5 x 10 4 M" 1 , more preferably with an affinity of at least 
1 x 10 7 M" 1 to 1 x 10 9 M" 1 or more. Frequently, hybridomas can 
be made wherein the monoclonal antibodies produced thereby 
have an affinity of at least 8 xlO 7 M" 1 . Chimeric antibodies 
comprising a heavy chain composed of a murine constant region 
and a human variable region, often capable of binding to a 
nonhuman antigen, may also be present in the serum or as an 
antibody secreted from a hybridoma. 

In some variations, it is desirable to generate 
transgenic mice which have inactivated endogenous mouse heavy 
chain loci which retain intact heavy chain constant region 
genes, and which have a human heavy chain transgene capable of 
trans-switching, and optionally also have a human light chain 
transgene, optionally with one or more inactivated endogenous 
mouse light chain loci. Such mice may advantageously produce 
B cells capable of alternatively expressing antibodies 
comprising fully human heavy chains and antibodies comprising 
chimeric (human variable/mouse constant) heavy chains, by 
trans-switching. The serum of said mice would contain 
antibodies comprising fully human heavy chains and antibodies 
comprising chimeric (human variable/mouse constant) heavy 
chains, preferably in combination with fully human light 
chains. Hybridomas can be generated from the B cells of said 
mice. 

Generally, such chimeric antibodies can be generated 
by trans -switching, wherein a human transgene encoding a human 



variable region (encoded by productive V-D-J rearrangement in 
vivo ) and a human constant region, typically human fi, 
undergoes switch recombination with a non-transgene 
immunoglobulin constant gene switch sequence (RSS) thereby 
operably linking the transgene-encoded human variable region 
with a heavy chain constant region which is not encoded by 
said transgene, typically an endogenous murine immunoglobulin 
heavy chain constant region or a heterologous (e.g., human) 
heavy chain constant region encoded on a second transgene. 
Whereas cis-switching refers to isotype-switching by 
recombination of RSS elements within a transgene, trans- 
switching involves recombination between a transgene RSS and 
an RSS element outside the transgene, often on a different 
chromosome than the chromosome which harbors the transgene. 

Trans-switching generally occurs between an RSS of 
an expressed transgene heavy chain constant region gene and 
either an RSS of an endogenous murine constant region gene (of 
a non-ju isotype, typically 7) or an RSS of a human constant 
region gene contained on a second transgene, often integrated 
on a separate chromosome. 

When trans-switching occurs between an RSS of a 
first, expressed transgene heavy chain constant region gene 
(e.g., /x) and an RSS of a human heavy chain constant region 
gene contained on a second transgene, a non-chimeric antibody 
having a substantially fully human sequence is produced. For 
example and not limitation, a polynucleotide encoding a human 
heavy chain constant region (e.g., 7I) and an operably linked 
RSS (e.g., a 71 RSS) can be introduced (e.g., transfected) 
into a population of hybridoma cells generated from a 
transgenic mouse B-cell (or B cell population) expressing an 
antibody comprising a transgene-encoded human ji chain. The 
resultant hybridoma cells can be selected for the presence of 
the introduced polynucleotide and/or for the expression of 
trans-switched antibody comprising a heavy chain having the 
variable region ( idiotype/ antigen reactivity) of the human 
chain and having the constant region encoded by the introduced 
polynucleotide sequence (e.g., human 7I) • Trans-switch 
recombination between the RSS of the transgene-encoded human 
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chain and the RSS of the introduced polynucleotide encoding a 
downstream isotype (e.g., 7I) thereby can generate a trans- 
switched antibody. 

The invention also provides a method for producing 
5 such chimeric trans-switched antibodies comprising the step of 
immunizing with a predetermined antigen a transgenic mouse 
comprising a genome comprising: (1) a homozygous functionally 
disrupted endogenous heavy chain locus comprising at least one 
murine constant region gene capable of trans-switching (e.g., 

10 72a, 72b, 71, 73) , (2) a human heavy chain transgene capable 
of rearranging to encode a functional human heavy chain 
variable region and expressing a human sequence heavy chain 
and capable of undergoing isotype switching (and/ or trans- 
switching) , and optionally further comprising (3) a human 

15 light chain (e.g. , kappa) transgene capable of rearranging to 
encode a functional human light (e.g., kappa) chain variable 
region and expressing a human sequence light chain, and 
optionally further comprising (4) a homozygous functionally 
disrupted endogenous light chain locus (typically k , 

2 0 preferably both k and X) , and optionally further comprising 

(5) a serum comprising an antibody comprising a chimeric heavy 
chain composed of a human sequence variable region encoded by 
a human transgene and a murine constant region sequence 
encoded by an endogenous murine heavy chain constant region 
25 gene (e.g., 71, 72a, 72b, 73). 

Affinity Tagging: Selecting for Switched Isotypes 

Advantageously, trans-switching (and cis-switching) 
is associated with the process of somatic mutation. Somatic 

3 0 mutation expands the range of antibody affinities encoded by 

clonal progeny of a B-cell. For example, antibodies produced 
by hybridoma cells which have undergone switching (trans- or 
cis-) represent a broader range of antigen-binding affinities 
than is present in hybridoma cells which have not undergone 
35 switching. Thus, a hybridoma cell population (typically 

clonal) which expresses a first antibody comprising a heavy 
chain comprising a first human heavy chain variable region in 
polypeptide linkage to a first human heavy chain constant 
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region (e.g., can be screened for hybridoma cell clonal 
variants which express an antibody comprising a heavy chain 
containing said first human heavy chain variable region in 
polypeptide linkage to a second heavy chain constant region 
5 (e.g., a human 7, a, or e constant region). Such clonal 

variants can be produced by natural clonal variation producing 
cis-switching in vitro , by induction of class switching 
(trans- or cis-) as through the administration of agents that 
promote isotype switching, such as T-cell-derived lymphokines 

10 (e.g., IL-4 and IFN y)/ by introduction of a polynucleotide 
comprising a functional RSS and a heterologous (e.g. human) 
heavy chain constant region gene to serve as a substrate for 
trans-switching, or by a combination of the above, and the 
like. Often, polynucleotides containing a human downstream 

15 isotype constant region (e.g., 7I, y3 , and the like) with an 
operably linked RSS will also be introduced into hybridoma 
cells to promote isotype switching via the trans-switch 
mechanism. 

Class switching and affinity maturation take place 

2 0 within the same population of B cells derived from transgenic 

animals of the present invention. Therefore, identification 
of class-switched B cells (or hybridomas derived therefrom) 
can be used as a screening step for obtaining high affinity 
monoclonal antibodies. A variety of approaches can be 
25 employed to facilitate class switching events such as cis- 
switching ( intratransgene switching), trans-switching, or 
both. For example, a single continuous human genomic fragment 
comprising both \i and 7 constant region genes with the 
associated RSS elements and switch regulatory elements (e.g., 

3 0 sterile transcript promoter) can be used as a transgene. 

However, some portions of the desired single contiguous human 
genomic fragment can be difficult to clone efficiently, such 
as due to instability problems when replicated in a cloning 
host or the like; in particular, the region between S and 73 
35 can prove difficult to clone efficiently, especially as a 

contiguous fragment comprising the \i gene, 73 gene, a V gene, 
D gene segments, and J gene segments. 
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Also for example, a discontinuous human transgene 
(minigene) composed of a human fj. gene, human 73 gene, a human 
V gene(s) , human D gene segments, and human J gene segments, 
with one or more deletions of an intervening (intronic) or 
otherwise nonessential sequence (e.g., one or more V, D, 
and/ or J segment and/ or one or more non-ju constant region 
gene(s)). Such minigenes have several advantages as compared 
to isolating a single contiguous segment of genomic DNA 
spanning all of the essential elements for efficient 
immunoglobulin expression and switching. For example, such a 
minigene avoids the necessity of isolating large pieces of DNA 
which may contain sequences which are difficult to clone 
(e.g., unstable sequences, poison sequences, and the like). 
Moreover, miniloci comprising elements necessary for isotype 
switching (e.g., human 7 sterile transcript promoter) for 
producing cis- or trans-switching, can advantageously undergo 
somatic mutation and class switching in vivo . As many 
eukaryotic DNA sequences can prove difficult to clone, 
omitting non-essential sequences can prove advantageous. 

In a variation, hybridoma clones producing 
antibodies having high binding affinity (e.g., at least 1 x 
10 7 M" 1 , preferably at least 1 x 10 s M" 1 , more preferably at 
least 1 x 10 9 M" 1 or greater) are obtained by selecting, from a 
pool of hybridoma cells derived from B cells of transgenic 
mice harboring a human heavy chain transgene capable of 
isotype switching ( see , supra) and substantially lacking 
endogenous murine heavy chain loci capable of undergoing 
productive (in-frame) V-D-J rearrangement, hybridomas which 
express an antibody comprising a heavy chain comprising a 
human sequence heavy chain variable region in polypeptide 
linkage to a human (or mouse) non-/x heavy chain constant 
region; said antibodys are termed "switched antibodies" as 
they comprise a "switched heavy chain" which is produced as a 
consequence of cis-switching and/ or trans-switching in vivo or 
in cell culture. Hybridomas producing switched antibodies 
generally have undergone the process of somatic mutation, and 
a pool of said hybridomas will generally have a broader range 
of antigen binding affinities from which hybridoma clones 



secreting high affinity antibodies can be selected. 
Typically, hybridomas secreting a human sequence antibody 
having substantial binding affinity (greater than 1 x 10 7 M" 1 
to 1 x 10 s M" 1 ) for a predetermined antigen and wherein said 
human sequence antibody comprises human immunoglobulin 
variable region (s) can be selected by a method comprising a 
two-step process* One step is to identify and isolate 
hybridoma cells which secrete immunoglobulins which comprise a 
switched heavy chain (e.g., by binding hybridoma cells to an 
immobilized immunoglobulin which specifically binds a switched 
heavy chain and does not substantially bind to an unswitched 
isotype, e.g. , m) • The other step is to identify hybridoma 
cells which bind to the predetermined antigen with substantial 
binding affinity (e.g., by ELISA of hybridoma clone 
supernatants , FACS analysis using labeled antigen, and the 
like) . Typically, selection of hybridomas which secrete 
switched antibodies is performed prior to identifying 
hybridoma cells which bind predetermined antigen. Hybridoma 
cells which express switched antibodies that have substantial 
binding affinity for the predetermined antigen are isolated 
and cultured under suitable growth conditions known in the 
art, typically as individual selected clones. Optionally, the 
method comprises the step of culturing said selected clones 
under conditions suitable for expression of monocloanl 
antibodies; said monoclonal antibodies are collected and can 
be administered for therapeutic, prophylactic, and/ or 
diagnostic purposes. 

Often, the selected hybridoma clones can serve as a 
source of DNA or RNA for isolating immunoglobulin sequences 
which encode immunoglobulins (e.g. a variable region) that 
bind to (or confer binding to) the predetermined antigen. 
Subsequently , the human variable region encoding sequence may 
be isolated (e.g., by PCR amplification or cDNA cloning from 
the source (hybridoma clone) ) and spliced to a sequence 
encoding a desired human constant region to encode a human 
sequence antibody more suitable for human therapeutic uses 
where immunogenicity is preferably minimized. The 
polynucleotide (s) having the resultant fully human encoding 
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sequence(s) can be expressed in a host cell (e.g., from an 
expression vector in a mammalian cell) and purified for 
pharmaceutical formulation. 

Xenoenhancers 

A heterologous transgene capable of encoding a human 
immunoglobulin (e.g., a heavy chain) advantageously comprises 
a cis-linked enhancer which is not derived from the mouse 
genome, and/or which is not naturally associated in cis with 
the exons of the heterologous transgene. For example, a human 
k transgene (e.g., a k minilocus) can advantageously comprise 
a human V/c gene, a human J/c gene, a human Ck gene, and a 
xenoenhancer , typically said xenoenhancer comprises a human 
heavy chain intronic enhancer and/ or a murine heavy chain 
intronic enhancer, typically located between a J k gene and the 
Ck gene, or located downstream of the Ck gene. For example, 
the mouse heavy chain J-jU intronic enhancer (Banerji et al. 
(1983) Cell 33 ; 729) can be isolated on a 0.9 kb Xbal fragment 
of the plasmid pKVe2 ( see , infra) . The human heavy chain J-/z 
intronic enhancer (Hayday et al. (1984) Nature 307: 3 34) 
can be isolated as a 1.4 kb Mlul/Hindlll fragment (see, 
infra) . Addition of a transcriptionally active xenoenhancer 
to a transgene, such as a combined xenoenhancer consisting 
essentially of a human J-/i intronic enhancer linked in cis to 
a mouse J-/i intronic enhancer, can confer high levels of 
expression of the transgene, especially where said transgene 
encodes a light chain, such as human /c. Similarly, a rat 3* 
enhancer can be advantageously included in a minilocus 
construct capable of encoding a human heavy chain. 

Specific Preferred Embodiments 

A preferred embodiment of the invention is an animal 
containing at least one, typically 2-10, and sometimes 25-50 
or more copies of the transgene described in Example 12 (e.g., 
pHCl or pHC2) bred with an animal containing a single copy of 
a light chain transgene described in Examples 5, 6, 8, or 14 , 
and the offspring bred with the J H deleted animal described in 
Example 10. Animals are bred to homozygosity for each of 
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these three traits. Such animals have the following genotype: 
a single copy (per haploid set of chromosomes) of a human 
heavy chain unrearranged mini-locus (described in Example 12) , 
a single copy (per haploid set of chromosomes) of a rearranged 
human k light chain construct (described in Example 14) , and a 
deletion at each endogenous mouse heavy chain locus that 
removes all of the functional J H segments (described in 
Example 10) . Such animals are bred with mice that are 
homozygous for the deletion of the J H segments (Examples 10) 
to produce offspring that are homozygous for the J H deletion 
and hemizygous for the human heavy and light chain constructs. 
The resultant animals are injected with antigens and used for 
production of human monoclonal antibodies against these 
antigens . 

B cells isolated from such an animal are 
monospecific with regard to the human heavy and light chains 
because they contain only a single copy of each gene. 
Furthermore, they will be monospecific with regards to human 
or mouse heavy chains because both endogenous mouse heavy 
chain gene copies are nonfunctional by virtue of the deletion 
spanning the J H region introduced as described in Example 9 
and 12* Furthermore, a substantial fraction of the B cells 
will be monospecific with regards to the human or mouse light 
chains because expression of the single copy of the rearranged 
human k light chain gene will allelically and isotypically 
exclude the rearrangement of the endogenous mouse k and X 
chain genes in a significant fraction of B-cells. 

The transgenic mouse of the preferred embodiment 
will exhibit immunoglobulin production with a significant 
repertoire, ideally substantially similar to that of a native 
mouse. Thus, for example, in embodiments where the endogenous 
Ig genes have been inactivated, the total immunoglobulin 
levels will range from about 0.1 to 10 mg/ml of serum, 
preferably 0.5 to 5 mg/ml, ideally at least about 1.0 mg/ml. 
When a transgene capable of effecting a switch to IgG from IgM 
has been introduced into the transgenic mouse, the adult mouse 
ratio of serum IgG to IgM is preferably about 10:1. Of 
course, the IgG to IgM ratio will be much lower in the 
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immature mouse. In general, greater than about 10%, 
preferably 40 to 80% of the spleen and lymph node B cells 
express exclusively human IgG protein. 

The repertoire will ideally approximate that shown 
in a non-transgenic mouse, usually at least about 10% as high, 
preferably 2 5 to 50% or more. Generally, at least about a 
thousand different immunoglobulins (ideally IgG), preferably 
10 4 to 10 6 or more, will be produced, depending primarily on 
the number of different V, J and D regions introduced into the 
mouse genome. These immunoglobulins will typically recognize 
about one-half or more of highly antigenic proteins, 
including, but not limited to: pigeon cytochrome C, chicken 
lysozyme, pokeweed mitogen, bovine serum albumin, keyhole 
limpit hemocyanin, influenza hemagglutinin, staphylococcus 
protein A, sperm whale myoglobin, influenza neuraminidase, and 
lambda repressor protein. Some of the immunoglobulins will 
exhibit an affinity for preselected antigens of at least about 
lO 7 ^" 1 , preferably lO 8 ^" 1 to lO 9 ^" 1 or greater. 

In some embodiments, it may be preferable to 
generate mice with predetermined repertoires to limit the 
selection of V genes represented in the antibody response to a 
predetermined antigen type. A heavy chain transgene having a 
predetermined repertoire may comprise, for example, human V H 
genes which are preferentially used in antibody responses to 
the predetermined antigen type in humans. Alternatively, some 
V H genes may be excluded from a defined repertoire for various 
reasons (e.g., have a low likelihood of encoding high affinity 
V regions for the predetermined antigen; have a low propensity 
to undergo somatic mutation and affinity sharpening; or are 
immunogenic to certain humans) . 

Thus, prior to rearrangement of a transgene 
containing various heavy or light chain gene segments, such 
gene segments may be readily identified, e.g. by hybridization 
or DNA sequencing, as being from a species of organism other 
than the transgenic animal. 

The transgenic mice of the present invention can be 
immunized with a predetermined antigen, such as a 
transmembrane proteins, cell surface macromolecule , or other 
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suitable antigen (e.g., TNF, LPS, etc.) for which a human 
antibody would be desirable. The mice will produce B cells 
which undergo class-switching via intratransgene switch 
recombination (cis-switching) and express immunoglobulins 
reactive with the predetemined antigen. The immunoglobulins 
can be human sequence antibodies, wherein the heavy and light 
chain polypeptides are encoded by human transgene sequences, 
which may include sequences derived by somatic mutation and V 
region recombinatorial joints, as well as germline-encoded 
sequences; these human sequence immunoglobulins can be 
referred to as being substantially identical to a polypeptide 
se q Uence encoded by a human V L or V H gene segment and a human 
J L or J L segment, even though other non-germline sequences may 
be present as a result of somatic mutation and differential V- 
J and V-D-J recombination joints. With respect to such human 
sequence antibodies, the variable regions of each chain are 
typically at least 80 percent encoded by human germline V, J, 
and, in the case of heavy chains, D, gene segments; frequently 
at least 85 percent of the variable regions are encoded by 
human germline sequences present on the transgene; often 90 or 
95 percent or more of the variable region sequences are 
encoded by human germline sequences present on the transgene. 
However , since non-germline sequences are introduced by 
somatic mutation and VJ and VDJ joining, the human sequence 
antibodies will frequently have some variable region sequences 
(and less frequently constant region sequences) which are not 
encoded by human V, D, or J gene gegments as found in the 
human transgene (s) in the germline of the mice. Typically, 
such non-germline sequences (or individual nucleotide 
positions) will cluster in or near CDRs, or in regions where 
somatic mutations are known to cluster. 

The human sequence antibodies which bind to the 
predetermined antigen can result from isotype switching, such 
that human antibodies comprising a human sequence y chain 
(such as yl, y2a. f y2B r or 73) and a human sequence light chain 
(such as K) are produced. Such isotype-switched human 
sequence antibodies often contain one or more somatic 
mutation (s) , typically in the variable region and often in or 
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within about 10 residues of a CDR) as a result of affinity 
maturation and selection of B cells by antigen, particualarly 
subsequent to secondary (or subsequent) antigen challenge. 
These high affinity human sequence antibodies may have 
binding affinities of at least 1 x 10 9 M" 1 , typically at least 
5 x lO 9 *!" 1 , frequently more than 1 x 10 10 M" 1 , and sometimes 5 
x lO^M" 1 to 1 x 10" 11 or greater. Such high affinity human 
sequence antibodies can be made with high binding affinities 
for human antigens, such as human CD4 and the like human 
macromolecules (e.g., such as a human transmembrane or cell 
surface protein or other cell surface antigen) . 

The B cells from such mice can be used to generate 
hybridomas expressing monoclonal high affinity (greater than 2 
x 10 9 M" 1 ) human sequence antibodies against a variety of 
antigens, including human proteins such as CD 4 and the like. 
These hybridomas can be used to generate a composition 
comprising an immunoglobulin having an affinity constant (K a ) 
of at least 2 x 10 9 M" 1 for binding to a predetermined human 
antigen, wherein said immunoglobulin consists of: 

a human sequence light chain composed of (1) a light 
chain variable region having a polypeptide sequene which is 
substantially identical to a polypeptide sequence encoded by a 
human V L gene segment and a human J L segment, and (2) a light 
chain constant region having a polypeptide sequence which is 
substantially identical to a polypeptide sequence encoded by a 
human C L gene segment; and 

a human sequence heavy chain composed of a (1) a 
heavy chain variable region having a polypeptide sequene which 
is substantially identical to a polypeptide sequence encoded 
by a human V H gene segment, optionally a D region, and a human 
J H segment, and (2) a constant region having a polypeptide 
sequence which is substantially identical to a polypeptide 
sequence encoded by a human C H gene segment. 

Often, the human sequence heavy chain and human 
sequence light chain are separately encoded by a human heavy 
chain transgene and a human light chain transgene, 
respectively, which are integrated into a mouse cell genome. 
However, both chains may be encoded on a single transgene, or 
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one or both chains may be encoded on multiple transgenes, such 
as a human heavy chain transgene (e.g., HC2) which derived a V 
gene segment from a YAC containing a V H array which is not 
integrated ar the same locus as the human heavy chain 
transgene in the mouse germline. 

In one embodiment, the composition has an 
immunoglobulin which comprises a human sequence light chain 
having a k constant region and a human sequence heavy chain 
having a 7 constant region. 

The mice (and hybridomas derived therefrom) are a 
source for an immunoglobulin having an affinity constant (K a ) 
of at least 1 xlO 10 M" 1 for binding to a predetermined human 
antigen, wherein said immunoglobulin consists of: 

a human sequence light chain composed of (1) a light 
chain variable region having a polypeptide sequene which is 
substantially identical to a polypeptide sequence encoded by a 
human V L gene segment and a human J L segment, and (2) a light 
chain constant region having a polypeptide sequence which is 
substantially identical to a polypeptide sequence encoded by a 
human C L gene segment; and 

a human sequence heavy chain composed of a (1) a 
heavy chain variable region having a polypeptide sequene which 
is substantially identical to a polypeptide sequence encoded 
by a human V H gene segment, optionally a D region, and a human 
J H segment, and (2) a constant region having a polypeptide 
sequence which is substantially identical to a polypeptide 
sequence encoded by a human C H gene segment. 

The invention provides a transgenic mouse 
comprising: a homozygous pair of functionally disrupted 
endogenous heavy chain alleles, a homozygous pair of 
functionally disrupted endogenous light chain alleles, at 
least one copy of a heterologous immunoglobulin light chain 
transgene, and at least one copy of a heterologous 
immunoglobulin heavy chain transgene, and wherein said animal 
makes an antibody response following immunization with a human 
antigen wherein the antibody response comprises an 
immunoglobulin having an affinity constant (K a ) of at least 2 
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x 10 9 M" 1 for binding to a predetermined human antigen, wherein 
said immunoglobulin consists of : 

a human sequence light chain composed of (1) a light 
chain variable region having a polypeptide sequene which is 
substantially identical to a polypeptide sequence encoded by a 
human V L gene segment and a human J L segment, and (2) a light 
chain constant region having a polypeptide sequence which is 
substantially identical to a polypeptide sequence encoded by a 
human C L gene segment; and 

a human sequence heavy chain composed of a (1) a 
heavy chain variable region having a polypeptide sequene which 
is substantially identical to a polypeptide sequence encoded 
by a human V H gene segment, optionally a D region, and a human 
J H segment, and (2) a constant region having a polypeptide 
sequence which is substantially identical to a polypeptide 
sequence encoded by a human C H gene segment. 

Such a transgenic mouse can produce a human sequence 
immunoglobulin which binds to a human surface or transmembrane 
protein present on at least one somatic cell type of a human, 
wherein the immunoglobulin binds said human surface or 
transmembrane protein with an affinity constant (K a ) of 
between 1.5 x 10 9 M" 1 and 1.8 x 10 10 M" 1 . One example of such 
a human surface or transmemebrane protein is CD4, although 
others may be used as immunogens as desired. 

The development of high affinity human sequence 
antibodies against predetermined antigens is facilitated by a 
method for expanding the repertoire of human variable region 
gene segments in a transgenic mouse having a genome comprising 
an integrated human immunoglobulin transgene, said method 
comprising introducing into the genome a V gene transgene 
comprising V region gene segments which are not present in 
said integrated human immunoglobulin transgene. Often, the V 
region transgene is a yeast artificial chromosome comprising a 
portion of a human V H or V L (V K ) gene segment array, as may 
naturally occur in a human genome or as may be spliced 
together separately by recombinant methods, which may include 
out-of-order or omitted V gene segments. Often at least five 
or more functional V gene segments are contianed on the YAC. 
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In this variation, it is possible to make a transgenic mouse 
produced by the V repertoire expansion method, wherein the 
mouse expresses an immunoglobulin chain comprising a variable 
region sequence encoded by a V region gene segment present on 
the V region transgene and a C region encoded on the human Ig 
transgene. By means of the V repertoire expansion method, 
transgenic mice having at least 5 distinct V genes can be 
generated; as can mice containing at least about 24 V genes or 
more. Of course, some V gene segments may be non-functional 
(e.g., pseudogenes and the like); these segments may be 
retained or may be selectively deleted by recombinant methods 
avaialble to the skilled artisan, if desired. 

Once the mouse germline has been engineered to 
contain a functional YAC having an expanded V segment 
repertoire, substantially not present in the human Ig 
transgene containing the J and C gene segments, the trait can 
be propagated and bred into other genetic backgrounds, 
including backgrounds where the functional YAC having an 
expanded V segment repertoire is bred into a mouse germline 
having a different human Ig transgene. Multiple functional 
YACs having an expanded V segment repertoire may be bred into 
a germline to work with a human Ig transgene (or multiple 
human Ig transgenes) . Although referred to herein as YAC 
transgenes, such transgenes when integrated into the genome 
may substantially lack yeast sequences, such as sequences 
required for autonomous replication in yeast; such sequences 
may optionally be removed by genetic engineering (e.g., 
restriction digestion and pulsed-field gel electrophoresis or 
other suitable method) after replication in yeast in no longer 
necessary (i.e., prior to introduction into a mouse ES cell or 
mouse prozygote) . 

The invention also provides a method of propagating 
the trait of human sequence immunoglobulin expression, 
comprising breeding a transgenic mouse having the human Ig 
transgene(s) , and optionally also having a functional YAC 
having an expanded V segment repertoire. Both V H and V L gene 
segemnts may be present on the YAC. The transgenic mouse may 
be bred into any background desired by the practitioner, 
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including backgrounds harboring other human transgenes, 
including human Ig transgenes and/ or transgenes encoding other 
human lymphocyte proteins. 

The invention also provides a high affinity human 
sequence immunoglobulin produced by a transgenic mouse having 
an expanded V region repertoire YAC transgene. 

Although the foregoing describes a preferred 
embodiment of the transgenic animal of the invention, other 
embodiments are defined by the disclosure herein and more 
particularly by the transgenes described in the Examples. 
Four categories of transgenic animal may be defined: 

I. Transgenic animals containing an unrearranged heavy 
and rearranged light immunoglobulin transgene. 

II. Transgenic animals containing an unrearranged heavy 
and unrearranged light immunoglobulin transgene 

III. Transgenic animal containing rearranged heavy and an 
unrearranged light immunoglobulin transgene, and 

IV. Transgenic animals containing rearranged heavy and 
rearranged light immunoglobulin transgenes. 

Of these categories of transgenic animal, the 
preferred order of preference is as follows II > I > III > IV 
where the endogenous light chain genes (or at least the k 
gene) have been knocked out by homologous recombination (or 
other method) and I > II > III >IV where the endogenous light 
chain genes have not been knocked out and must be dominated by 
allelic exclusion. 

As is discussed supra, the invention provides human 
sequence monoclonal antibodies that are useful in treatment of 
human diseases. Therapeutic uses of monoclonal antibodies are 
discussed in, e.g., Larrick and Bourla, Journal of Biological 
Response Modifiers, 5:379-393, which is incorporated herein by 
reference. Uses of human monoclonal antibodies include 
treatment of autoimmune diseases, cancer, infectious diseases, 
transplant rejection, blood disorders such as coagulation 
disorders, and other diseases. 

The antibodies of this invention may be administered 
to patients by any method known in the medical arts for 
delivery of proteins. Antibodies are particularly suited for 
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parenteral administration (i.e, subcutaneous, intramuscular or 
intravenous administration) . The pharmaceutical compositions 
of the present invention are suitable for administration using 
alternative drug delivery approaches as well (see, e.g., 
Langer, Science, 249:1527-1533 (1990)). 

Pharmaceutical compositions for parenteral 
administration usually comprise a solution of a monoclonal 
antibody dissolved in an acceptable carrier, preferably an 
aqueous carrier. A variety of aqueous carriers can be used, 
e.g., water, buffered water, 0.4% saline, 0.3% glycine and the 
like. These solutions are sterile and generally free of 
particulate matter. These compositions may be sterilized by 
conventional, well known sterilization techniques. The 
compositions may contain pharmaceutically acceptable auxiliary 
substances as required to approximate physiological conditions 
such as pH-adjusting and buffering agents, tonicity adjusting 
agents and the like, for example sodium acetate, sodium 
chloride, potassium chloride, calcium chloride, sodium 
lactate, etc. The concentration of antibody in these 
f ormulations can vary widely, i.e., from less than about 0.5%, 
usually at or at least about 0.1% to as much as 1.5% or 2.0% 
by weight and will be selected primarily based on fluid 
volumes, viscosities, etc., in accordance with the particular 
mode of administration selected. Actual methods for preparing 
parenterally administrable compositions will be known or 
apparent to those skilled in the art and are described in more 
detail in, for example, Remington's Pharmaceutical Sciences, 
17th Ed. , Mack Publishing Company, Easton, Pennsylvania 
(1985) , which is incorporated herein by reference. 

The compositions containing the present antibodies 
or a cocktail thereof can be administered for the prophylactic 
and/or therapeutic treatments. In therapeutic application, 
compositions are . administered to a patient in an amount 
sufficient to cure or at least partially arrest the infection 
and its complications. An amount adequate to accomplish this 
is defined as a "therapeutically effective dose." Amounts 
effective for this use generally range from about .05 mg/kg 
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body weight to about 5 mg/kg body weight, preferably between 
about .2 mg/kg body weight to about 1.5 mg/kg body weight. 

In some instances it will be desirable to modify the 
immunoglobulin molecules of the invention to change their 
5 biological activity. For example, the immunoglobulins can be 
directly or indirectly coupled to other chemotherapeutics 
agent. A variety of chemotherapeutics can be coupled for 
targeting. For example, ant i- inflammatory agents which may be 
coupled include immunomodulators , platelet activating factor 

10 (PAF) antagonists , cyclooxygenase inhibitors , lipoxygenase 
inhibitors, and leukotriene antagonists. Some preferred 
moieties include cyclosporin A, indomethacin, naproxen, FK- 
506, mycophenolic acid, and the like. Similarly, anti- 
oxidants, e.g., superoxide dismutase, are useful in treating 

15 reperfusion injury. Likewise, anticancer agents, such as 

daunomycin, doxorubicin, vinblastine, bleomycin, and the like 
can be targeted. 

The monoclonal antibodies of the invention may also 
be used to target amphipaths (e.g., liposomes) to sites in a 

2 0 patient. In these preparations, the drug to be delivered is 

incorporated as part of a liposome in which a human monoclonal 
ant ibody i s embedded . 

The human-sequence monoclonal antibodies of the 
invention are useful, in part, because they bind specifically 

2 5 to the predetermined antigen against which they are directed. 

When the predetermined antigen is a human antigen (i.e., a 
human protein or fragment thereof) , it will sometimes be 
advantageous if the human immunoglobulin of the invention also 
binds to the cognate antigen found in non-human animals, 

3 0 especially animals that are used frequently for drug testing 

(e.g., preclinical testing of biological activity, 
pharmacokinetics and safety) . These animals include mice, 
rabbits, rats, dogs, pigs, and, especially, non-human primates 
such as chimpanzees, apes and monkeys (e.g., Rhesus monkeys 
3 5 and cynomolgus monkeys) . The ability to recognize antigens in 
experimental animals is particularly useful for determining 
the effect of specific binding on biodistribution of the 
immunoglobulins. A cognate antigen is an antigen that (i) has 
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a structure (e.g., amino acid sequence) that is substantially 
similar to the human antigen (i.e., the amino acid sequence of 
an animal cognate protein will typically be at least about 50% 
identical to the human protein, usually at least about 70% 
identical and often at least about 80% identical or more) ; 
(ii) has substantially the same function as the human antigen; 
and, (iii) often is found in the same cellular compartment as 
the human antigen. Human and animal cognate antigens typically 
(but not always) have the same names. Examples of cognate 
antigens include human tubulin and mouse tubulin, human CD4 
and Rhesus CD4, and human IgG and Rat IgG. 

An other aspect, the invention provides antigen- 
binding human mABs comprising at least one polypeptide encoded 
by an artificial gene. An artificial gene comprises a 
polypeptide-encoding nucleic acid segment that is synthesized 
in vitro by chemical or enzymatic methods that do not require 
a cell-derived template nucleic acid strand (e.g., a nucleic 
acid template obtained from a bacterial cell or an immune or 
hybridoma cell) and the progeny (through replication) of the 
artificial gene, i.e., a wholly synthetic nucleic acid. 

Although it is routine in genetic engineering to use 
short synthetic nucleic acids as primers, linkers and the 
like, it is also possible by chemical and/or enzymatic means 
to produce wholly synthetic protein-coding nucleic acids that 
are 30, 50, or more bases in length. The artificial genes of 
the invention may include both synthetic nucleic acid regions 
and cell-derived nucleic acid regions. The synthetic nucleic 
acid region of the artificial gene will generally be at least 
about 50 bases in length, often at least about 100 bases, 
typically at least about 200 bases, more often at least about 
250 bases and usually over 3 00 bases or 400 bases in length. 
Typically the synthetic nucleic acid regions will encode 
variable gene segments or a portion thereof, e.g., CDR 
regions, and the constant regions will be encoded by cell- 
derived nucleic acids. Immunoglobulin polypeptides (i.e., 
immunoglobulin heavy chains and immunoglobulin light chains) 
can be conveniently expressed using artificial genes that 
encode the polypeptides. Usually the artificial genes are 
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operably linked to transcription promoter sequences, e.g., 
promoter sequences derived from immunoglobulin genes or from 
viruses (e.g., SV40, CMV, HIV, RSV) or hybrid promoters. The 
artificial gene may be linked to other sequences as well, e.g. 
5 polyadenylation sequences and introns. One method for 

expressing an immunoglobulin polypeptide involves insertion of 
a synthetic nucleic acid encoding one region of an 
immunoglobulin polypeptide (e.g., a variable region or portion 
thereof) into a vector that encodes the remaining segments or 

10 parts of the immunoglobulin chain (e.g., a M/ 7/ 7 2 / 73/ 7 4 / 
6, e, a x or a 2 constant region) and, optionally, promoter 
(e.g., a CMV (cytomegalovirus) promoter), polyadenylation or 
other sequences. Such vectors are constructed so that upon 
introduction into a cell, the cellular transcription and 

15 translation of the vector sequences results in an immunoglobin 
polypeptide . 

Functional human sequence immunoglobulin heavy and 
light chain genes and polypeptides can be constructed using 
artificial genes, and used to produce immunoglobulins with a 

2 0 desried specificity such as specific binding to a 

predetermined antigen. This is accomplished by constructing 
an artificial gene that encodes an immunoglobulin polypeptide 
substantially similar to a polypeptide expressed by a cell 
from, or a hybridoma derived from, a transgenic animal 
25 immunized with the predetermined antigen. Thus, the invention 
provides artificial genes encoding immunoglobulin polypeptides 
and methods for producing a human-sequence immunoglobulin 
using an artificial gene(s) . 

According to this method, a transgenic animal (e.g., 

3 0 a transgenic mouse with a homozygous pair of functionally 

disrupted endogenous heavy chain alleles, a homozygous pair of 
functionally disrupted endogenous light chain alleles, at 
least one copy of a human immunoglobulin light chain 
transgene, and at least one copy of a human immunoglobulin 
3 5 heavy chain transgene) is immunized with predetermined 

antigen, e.g., a human protein. Nucleic acid, preferably 
mRNA, is then collected or isolated from a cell or population 
of cells in which immunoglobulin gene rearrangement has taken 
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place, and the sequence (s) of nucleic acids encoding the heavy 
and/or light chains (especially the V segments) of 
immunoglobulins, or a portion thereof, is determined. This 
sequence information is used as a basis for the sequence of 

5 the artificial gene. 

Sequence determination will generally require 
isolation of at least a portion of the gene or cDNA of 
interest, e.g., a portion of a rearranged human transgene or 
corresponding cDNA encoding an immunoglobulin polypeptide. 
10 Usually this requires cloning the DNA or, preferably, mRNA 
(i.e., cDNA) encoding the human immunoglobulin polypeptide. 
Cloning is carried out using standard techniques (see, e.g., 
Sambrook et al . (1989) Molecular Cloning: A Laboratory Guide, 
Vols 1-3, Cold Spring Harbor Press, which is incorporated 

15 herein by reference) . For example, a cDNA library may be 
constructed by reverse transcription of polyA+ mRNA, 
preferably membrane-associated mRNA, and the library screened 
using probes specific for human immunoglobulin polypeptide 
gene sequences. In a preferred embodiment, however, the 

20 polymerase chain reaction (PCR) is used to amplify cDNAs (or 

portions of full-lenght cDNAs) encoding an immunoglobulin gene 
segment of interest (e.g., a light chain variable segment). 
Because the sequences of the human immunoglobulin polypeptide 
genes are readily available to those of skill, probes or PCR 

25 primers that will specifically hybridize to or amplify a human 
immunoglobulin gene or segment thereof can be easily designed. 
See, e.g., Taylor et al . , Nuc . Acids. Res., 20:6287 (1992) 
which is incorporated by reference. Moreover, the sequences 
of the human transgene of the transgenic mouse will often be 

3 0 known to the practicioner , and primer sequences can be chosen 
that hybridize to appropriate regions of the transgene. The 
amplified sequences can be readily cloned into any suitable 
vector, e.g., expression vectors, minigene vectors, or phage 
display vectors. It will be appreciated that the particular 

35 method of cloning used not critical, so long as it is possible 
to determine the sequence of some portion of the 
immunoglobulin polypeptide of interest. As used herein, a 
nucleic acid that is cloned, amplified, tagged, or otherwise 
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distinguished from background nucleic acids such that the 
seqence of the nucleic acid of interest can be determined, is 
considered isolated. 

One source for RNA used for cloning and sequencing 
5 is a hybridoma produced by obtaining a B cell from the 

transgenic mouse and fusing the B cell to an immortal cell. 
An advantage of using hybridomas is that they can be easily 
screened, and a hybridoma that produces a human monoclonal 
antibody of interest selected. Alternatively, RNA can be 

10 isolated from B cells (or whole spleen) of the immunized 

animal. When sources other than hybridomas are used, it may 
be desirable to screen for sequences encoding immunoglobulins 
or immunoglobulin polypeptides with specific binding 
characteristics. One method for such screening is the use of 

15 phage display technology. Phage display is described in e.g., 
Dower et al., WO 91/17271, McCafferty et al. , WO 92/01047, and 
Caton and Koprowski, Proc. Natl. Acad. Sci . USA, 87:6450-6454 
(1990) , each of which is incorporated herein by reference. In 
one embodiment using phage display technology, cDNA from an 

20 immunized transgenic mouse (e.g., total spleen cDNA) is 

isolated, the polymerase chain reaction is used to amplify a 
cDNA sequences that encode a portion of an immunoglobulin 
polypeptide, e.g., CDR regions, and the amplified sequences 
are inserted into a phage vector. cDNAs encoding peptides of 

25 interest, e.g., variable region peptides with desired binding 
characteristics, are identified by standard techniques such as 
panning. 

The sequence of the amplified or cloned nucleic acid 
is then determined. Typically the sequence encoding an entire 

3 0 variable region of the immunoglobulin polypeptide is 

determined, however, it will sometimes by adequate to sequence 
only a portion of a variable region, for example, the CDR— 
encoding portion. Typically the portion sequenced will be at 
least 3 0 bases in length, more often based coding for at least 

35 about one-third or aty least about one-half of the length of 
the variable region will be sequenced. 

Sequencing can be carried on clones isolated from a 
cDNA library, or, when PCR is used, after subcloning the 



amplified sequence or by direct PCR sequencing of the 
amplified segment. Sequencing is carried out using standard 
techniques (see, e.g., Sambrook et al . (1989) Molecular 
Cloning: A Laboratory Guide, Vols 1-3, Cold Spring Harbor 
Press, and Sanger, F. et al. (1977) Proc. Natl. Acad. Sci. USA 
74: 5463-5467, which is incorporated herein by reference). By 
comparing the sequence of the cloned nucleic acid with 
published sequences of human immunoglobulin genes and cDNAs, 
one of skill will readily be able to determine, depending on 
the region sequenced, (i) the germline segment usage of the 
hybridoma immunoglobulin polypeptide (including the isotype of 
the heavy chain) and (ii) the sequence of the heavy and light 
chain variable regions, including sequences resulting from N- 
region addition and the process of somatic mutation. One 
source of immunoglobulin gene sequence information is the 
National Center for Biotechnology Information, National 
Library of Medicine, National Institutes of Health, Bethesda, 
Md. 

In an alternative embodiment, the amino acid 
sequence of an immunoglobulin of interest may be determined by 
direct protein sequencing. 

An artificial gene can be constructed that has a 
sequence identical to or substantially similar to, at least a 
portion of the immunoglobulin-expressing gene (i.e., 
rearranged transgene) . Similarly, the artificial gene can 
encode an polypeptide that is identical or has substantial 
similarity to a polypeptide encoded by the sequenced portion 
of the rearranged transgene. The degeneracy of the genetic 
code allows the same polypeptide to be encoded by multiple 
nucleic acid sequences. It is sometimes desirable to change 
the nucleic acid sequence, for example to introduce 
restriction sites, change codon usage to reflect a particular 
expression system, or to remove a glycosylation site. In 
addition, changes in the hybridoma sequences may be introduced 
to change the characteristics (e.g., binding characteristics) 
of the immunoglobulin. For example, changes may be 
introduced, especially in the CDR regions of the heavy and 
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light chain variable regions, to increase the affinity of the 
immunoglobulin for the predetermined antigen. 

Methods for constructing an synthetic nucleic acids 
are well known* An entirely chemical synthesis is possible 
but in general, a mixed chemical-enzymatic synthesis is 
carried out in which chemically synthesized oligonucleotides 
are used in ligation reactions and/or in the polymerase chain 
reaction to create longer polynucleotides. In a most 
preferred embodiment, the polymerase chain reaction is carried 
out using overlapping primers chosen so that the result of the 
amplification is a DNA with the sequence desired for the 
artificial gene. The oligonucleotides of the present 
invention may be synthesized in solid phase or in solution. 
Generally, solid phase synthesis is preferred. Detailed 
descriptions of the procedures for solid phase synthesis of 
oligonucleotides by phosphite-triester , phosphotriester , and 
H-phosphonate chemistries are widely available. See, for 
example, Itakura, U.S. Pat. No. 4,401,796; Caruthers et al . , 
U.S. Pat. Nos. 4,458,066 and 4,500,707; Beaucage et al . , 
Tetrahedron Lett., 22:1859-1862; Matteucci et al . , J . Amer. 
Chem. Soc, 103:3185-3191 (1981); Caruthers et al., Genetic 
Engineering, 4:1-17 (1982); Jones, chapter 2, Atkinson et al . , 
chapter 3, and Sproat et al . , chapter 4, in Gait, ed. 
Oligonucleotide Synthesis : A Practical Approach, IRL Press, 
Washington, D.C. (1984); Froehler et al . , Tetrahedron Lett., 
27:469-472 (1986); Froehler et al . , Nucleic Acids Res., 
14:5399-5407 (1986); Sinha et al . , Tetrahedron Lett., 24:5843- 
5846 (1983); and Sinha et al . , Nucleic Acids Res., 12:4539- 
4557 (1984) which are incorporated herein by reference. 

The artificial gene can introduced into a cell and 
expressed to produce an immunoglobulin polypeptide. The 
choice of cell type for expression will depend on many factors 
(e.g., the level of protein glycosylation desired), but cells 
capable of secreting human immunoglobulins will be preferred. 
Especially preferred cells include CHO cells and myeloma- 
derived cells such as the SP20 and NS0 cell lines. Standard 
cell culture are well known and are also described in Newman, 
et al., Biotechnology, 10:1455-1460 (1992); Bebbington, et 



al., Biotechnology, 10:169-175 (1992); Cockett, et al., 
Biotechnology, 8:662-667 (1990); Carter, et al w 
Biotechnology, 10:163-167 (1992), each of which is 
incorporated herein by reference. Methods for introduction of 
nucleic acids, e.g., an artificial gene, are well known and 
include transfection (e.g., by electroporation or liposome- 
mediated) and transformation. Systems for expression of 
introduced genes are described generally in Sambrook et al., 
supra . 

It is often desirable to express two immunoglobulin 
polypeptides (i.e., a heavy chain and a light chain) in the 
same cell so that an immunoglobulin (e.g., an IgG molecule) is 
produced in vivo. Accordingly it will sometimes be desirable 
to introduce two artificial genes (i.e., one encoding a heavy 
chain and one encoding a light chain) into a cell. (The two 
artificial genes can be introduced on a single vector) . 
Alternatively, one artificial gene encoding one immunoglobulin 
polypeptide can be introduced into a cell that has been 
genetically engineered to express the other immunoglobulin 
polypeptide. 

It will be apparent that as the cells into which the 
artificial gene is transfected propagate, the wholly synthetic 
nucleic acid portion of the artificial gene, will act as a 
template for replication and transcription. Nonetheless, the 
progeny genes will have originated from a synthetic nucleic 
acid (i.e., a polypeptide-encoding nucleic acid molecule that 
is synthesized in vitro by chemical or enzymatic methods that 
do not require a cell-derived template nucleic acid strand) 
and as used herein, are also considered artificial genes. 
Thus, the relationship of the synthetic portion of the 
artificial gene to the expressed transgene of the hybridoma is 
one in which there is an informational link (i.e., sequence 
information) but no direct physical link. 

The invention also provides anti-CD4 monoclonal 
antibodies useful in therapeutic and diagnostic applications, 
especially the treatment of human disease. CD 4 is a cell 
surface protein that is expressed primarily on thymocytes and 
T cells, and which is involved in T-cell function and MHC 
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Class II recognition of antigen. Antibodies directed against 
CD4 act to reduce the activity of CD4 cells and thus reduce 
undesirable autoimmune reactions, inflammatory responses and 
rejection of transplanted organs. 

Indeed, administration of anti-CD4 mABs has been 
shown to prevent (Wofsy, et al., J. Exp. Med., 161:378-391 
(1985)) or reverse (Wofsy, et al., J. Immunol., 138:3247-3253 
(1987), Waldor, et al., Science, 227:415-417 (1985)) 
autoimmune disease in animal models. Administration of murine 
or chimeric anti-CD4 mAbs to patients with rheumatoid 
arthritis has shown evidence of clinical benefit (Knox, et 
al., Blood, 77:20-30 (1991); Goldbery, et al. , J. 
Autoimmunity, 4:617-630; Herzog, et al. , Lancet, ii: 1461-1462 
; Horneff, et al. , Arthritis Rheum., 34:129-140; Reiter, et 
al., Arthritis Rheum., 34:525-536; Wending, et al. , J. Rheum., 
18:325-327; Van der Lubbe, et al. , Arthritis Rheum., 38:1097- 
1106; Van der Lubbe, et al. , Arthritis Rheum., 36:1375-1379; 
Moreland, et al. , Arthritis Rheum., 36:307-318, and Choy, et 
al., Arthritis and Rheumatism, 39(l):52-56 (1996); all of 
which is incorporated herein by reference) . In addition, as 
noted above, a chimeric anti-CD4 mAB has shown some clinical 
efficacy in patients with mycosis fungoides (Knox et al. 
(1991) Blood 77:20; which is incorporated herein by 
reference) . Anti-CD4 antibodies are also discussed in Newman, 
et al., Biotechnology, 10:1455-1460 (1992), which is 
incorporated herein by reference. 

EXPERIMENTAL EXAMPLES 

METHODS AND MATERIALS 

Transgenic mice are derived according to Hogan, et 
al., "Manipulating the Mouse Embryo: A Laboratory Manual", 
Cold Spring Harbor Laboratory, which is incorporated herein by 
reference. 

Embryonic stem cells are manipulated according to 
published procedures (Teratocarcinomas and embryonic stem 
cells: a practical approach, E.J. Robertson, ed. , IRL Press, 
Washington, D.C., 1987; Zjilstra et al., Nature 342 : 435-438 
(1989); and Schwartzberg et al., Science 246 :799-803 (1989), 
each of which is incorporated herein by reference) . 



DNA cloning procedures are carried out according to 
J. Sambrook, et al. in Molecular Cloning: A Laboratory 
Manual, 2d ed. , 1989, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, N.Y., which is incorporated herein by 
reference. 

Oligonucleotides are synthesized on an Applied Bio 
Systems oligonucleotide synthesizer according to 
specifications provided by the manufacturer. 

Hybridoma cells and antibodies are manipulated 
according to "Antibodies: A Laboratory Manual", Ed Harlow and 
David Lane, Cold Spring Harbor Laboratory (1988) , which is 
incorporated herein by reference. 

EXAMPLE 1 

Genomic Heavy Chain Human la Transcrene 
This Example describes the cloning and 
microinjection of a human genomic heavy chain immunoglobulin 
transgene which is microinjected into a murine zygote. 

Nuclei are isolated from fresh human placental 
tissue as described by Marzluff et al., "Transcription and 
Translation: A Practical Approach", B.D. Hammes and 
S.J. Higgins, eds. , pp. 89-129, 1RL Press, Oxford (1985)). 
The isolated nuclei (or PBS washed human spermatocytes) are 
embedded in a low melting point agarose matrix and lysed with 
EDTA and proteinase k to expose high molecular weight DNA, 
which is then digested in the agarose with the restriction 
enzyme NotI as described by M. Finney in Current Protocols in 
Molecular Biology (F. Ausubel, et al., eds. John Wiley & Sons, 
Supp. 4, 1988, Section 2.5.1). 

The NotI digested DNA is then fractionated by pulsed 
field gel electrophoresis as described by Anand et al., 
Nucl. Acids Res , 17.: 3425-3433 (1989). Fractions enriched for 
the NotI fragment are assayed by Southern hybridization to 
detect one or more of the sequences encoded by this fragment. 
Such sequences include the heavy chain D segments, J segments, 
y, and 71 constant regions together with representatives of all 
6 VH families (although this fragment is identified as 670 kb 
fragment from HeLa cells by Berman et al. (1988), supra., we 
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have found it to be as 83 0 kb fragment from human placental an 
sperm DNA) . Those fractions containing this NotI fragment 
(see Fig- 4) are pooled and cloned into the NotI site of the 
vector pYACNN in Yeast cells. Plasmid pYACNN is prepared by 
digestion of pYAC-4 Neo (Cook et al., Nucleic Acids Res. 16: 
11817 (1988)) with EcoRI and ligation in the presence of the 
oligonucleotide 5' - AAT TGC GGC CGC - 3'. 

YAC clones containing the heavy chain NotI fragment 
are isolated as described by Brownstein et al., Science 
244 : 1348-1351 (1989), and Green et al., Proc. Natl. Acad. Sci. 
USA 87:1213-1217 (1990), which are incorporated herein by 
reference. The cloned NotI insert is isolated from high 
molecular weight yeast DNA by pulse field gel electrophoresis 
as described by M. Finney, op cit. The DNA is condensed by 
the addition of 1 mM spermine and microinjected directly into 
the nucleus of single cell embryos previously described. 

EXAMPLE 2 

Genomic k Light Chain Human Ig Transgene 
Formed by In Vivo Homologous Recombination 

A map of the human k light chain has been described 
in Lorenz et al. , Nucl. Acids Res. 15:9667-9677 (1987), which 
is incorporated herein by reference. 

A 450 kb Xhol to NotI fragment that includes all of 
C K , the 3' enhancer, all J segments, and at least five 
different V segments is isolated and microinjected into the 
nucleus of single cell embryos as described in Example 1. 

EXAMPLE 3 

Genomic k Light Chain Human Ig Transgene 
Formed by In Vivo Homologous Recombination 

A 750 kb Mlul to NotI fragment that includes all of 
the above plus at least 2 0 more V segments is isolated as 
described in Example 1 and digested with BssHII to produce a 
fragment of about 400 kb. 

The 450 kb Xhol to NotI fragment plus the 
approximately 400 kb Mlul to BssHII fragment have sequence 
overlap defined by the BssHII and Xhol restriction sites. 
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Homologous recombination of these two fragments upon 
microinjection of a mouse zygote results in a transgene 
containing at least an additional 15-20 V segments over that 
found in the 450 kb XhoI/NotI fragment (Example 2) . 

EXAMPLE 4 

Construction of Heavy Chain Mini-Locus 
A. Construction of pGPI and PGP2 

pBR322 is digested with EcoRI and Styl and ligated 
with the following oligonucleotides to generate pGPI which 
contains a 147 base pair insert containing the restriction 
sites shown in Fig. 8. The general overlapping of these 
oligos is also shown in Fig. 9. 

The oligonucleotides are: 

5 i _ CTT GAG CCC GCC TAA TGA GCG GGC TTT 
TTT TTG CAT ACT GCG GCC - 3 1 

5 1 - GCA ATG GCC TGG ATC CAT GGC GCG CTA 
GCA TCG ATA TCT AGA GCT CGA GCA -3 1 

5' - TGC AGA TCT GAA TTC CCG GGT ACC AAG 
CTT ACG CGT ACT AGT GCG GCC GCT -3 • 

5 1 - AAT TAG CGG CCG CAC TAG TAC GCG TAA 
GCT TGG TAC CCG GGA ATT - 3 1 

5 1 - CAG ATC TGC ATG CTC GAG CTC TAG ATA 
TCG ATG CTA GCG CGC CAT GGA TCC - 3 1 

5 1 - AGG CCA TTG CGG CCG CAG TAT GCA AAA 
AAA AGC CCG CTC ATT AGG CGG GCT - 3 • 

This plasmid contains a large polylinker flanked by 
rare cutting NotI sites for building large inserts that can be 
isolated from vector sequences for microinjection. The 
plasmid is based on pBR3 22 which is relatively low copy 
compared to the pUC based plasmids (pGPI retains the pBR3 22 
copy number control region near the origin of replication) . 
Low copy number reduces the potential toxicity of insert 
sequences. In addition, pGPI contains a strong transcription 
terminator sequence derived from trpA (Christie et al., Proc. 
Natl . Acad . Sci . USA 78:4180 (1981)) inserted between the 
ampicillin resistance gene and the polylinker. This further 
reduces the toxicity associated with certain inserts by 



oligo-1 
oligo-2 
oligo-3 
oligo-4 
oligo-5 
oligo-6 



102 

preventing readthrough transcription coming from the 
ampicillin promoters. 

Plasmid pGP2 is derived from pGPl to introduce an 
additional restriction site (Sfil) in the polylinker. pGPl is 
digested with Mlul and Spel to cut the recognition sequences 
in the polylinker portion of the plasmid. 

The following adapter oligonucleotides are ligated 
to the thus digested pGPl to form pGP2 . 

5 ' CGC GTG GCC GCA ATG GCC A 3 * 
5 1 CTA GTG GCC ATT GCG GCC A 3 1 

pGP2 is identical to pGPl except that it contains an 
additional Sfi I site located between the Mlul and Spel sites. 
This allows inserts to be completely excised with Sfil as well 
as with NotI . 

B. Construction of pRE3 (rat enhancer 3') 

An enhancer sequence located downstream of the rat 
constant region is included in the heavy chain constructs. 

The heavy chain region 3 1 enhancer described by 
Petterson et al., Nature 344 : 165-168 (1990), which is 
incorporated herein by reference) is isolated and cloned. The 
rat IGH 3 1 enhancer sequence is PCR amplified by using the 
following oligonucleotides: 

5 1 CAG GAT CCA GAT ATC AGT ACC TGA AAC AGG GCT TGC 3 1 
5 ' GAG CAT GCA CAG GAC CTG GAG CAC ACA CAG CCT TCC 3 ' 

The thus formed double stranded DNA encoding the 3 • 
enhancer is cut with BamHI and SphI and clone into BamHI/SphI 
cut pGP2 to yield pRE3 (rat enhancer 3'). 

C. Cloning of Human J-u Region 

A substantial portion of this region is cloned by 
combining two or more fragments isolated from phage lambda 
inserts. See Fig. 9. 



A 6,3 kb BamHI/Hindlll fragment that includes all 
human J segments (Matsuda et al. , EMBO J. . 7:1047-1051 (1988); 
Ravetech et al.m Cell . 27:583-591 (1981), which are 
incorporated herein by reference) is isolated from human 
genomic DNA library using the oligonucleotide GGA CTG TGT CCC 
TGT GTG ATG CTT TTG ATG TCT GGG GCC AAG. 

An adjacent 10 kb Hindlll/Bamll fragment that 
contains enhancer, switch and constant region coding exons 
(Yasui et al., Eur. J. Immunol. 19:1399-1403 (1989)) is 
similarly isolated using the oligonucleotide: 
CAC CAA GTT GAC CTG CCT GGT CAC AGA CCT GAC CAC CTA TGA 

An adjacent 3 1 1.5 kb BamHI fragment is similarly 
isolated using clone pMUM insert as probe (pMUM is 4 kb 
EcoRI/Hindlll fragment isolated from human genomic DNA library 
with oligonucleotide: 

CCT GTG GAC CAC CGC CTC CAC CTT CAT 

CGT CCT CTT CCT CCT 
Hiu membrane exon 1) and cloned into pUC19. 

pGPl is digested with BamHI and Bglll followed by 
treatment with calf intestinal alkaline phosphatase. 

Fragments (a) and (b) from Fig. 9 are cloned in the 
digested pGPl. A clone is then isolated which is oriented 
such that 5 f BamHI site is destroyed by BamHI/Bgl fusion. It 
is identified as pMU (see Fig. 10). pMU is digested with 
BamHI and fragment (c) from Fig. 9 is inserted. The 
orientation is checked with Hindlll digest. The resultant 
plasmid pHIGl (Fig. 10) contains an 18 kb insert encoding J 
and Cm segments. 

D. Cloning of Cu Region 

pGPl is digested with BamHI and Hindlll is followed 
by treatment with calf intestinal alkaline phosphatase (Fig. 
14) . The so treated fragment (b) of Fig. 14 and fragment (c) 
of Fig. 14 are cloned into the BamHI/Hindlll cut pGPl. Proper 
orientation of fragment (c) is checked by Hindlll digestion to 
form pCONl containing a 12 kb insert encoding the C/x region. 

Whereas pHIGl contains J segments, switch and (jl 
sequences in its 18 kb insert with an Sfil 3 f site and a Spel 
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5' site in a polylinker flanked by NotI sites, will be used 
for rearranged VDJ segments. pCONl is identical except that 
it lacks the J region and contains only a 12 kb insert. The 
use of pCONl in the construction of fragment containing 
rearranged VDJ segments will be described hereinafter. 

E. Cloning of 7-1 Constant Region fpREG2) 

The cloning of the human 7-1 region is depicted in 

Fig. 16. 

Yamamura et al., Proc . Natl . Acad . Sci . USA 
83.: 2152-2156 (1986) reported the expression of membrane bound 
human 7-1 from a transgene construct that had been partially 
deleted on integration. Their results indicate that the 3' 
BamHI site delineates a sequence that includes the 
transmembrane rearranged and switched copy of the gamma gene 
with a V-C intron of less than 5kb. Therefore, in the 
unrearranged, unswitched gene, the entire switch region is 
included in a sequence beginning less than 5 kb from the 5 1 
end of the first 7-1 constant exon. Therefore it is included 
in the 5' 5.3 kb Hindlll fragment (Ellison et al.. Nucleic 
Acids Res. 10:4071-4079 (1982), which is incorporated herein 
by reference). Takahashi et al. , Cell 29 : 671-679 (1982), 
which is incorporated herein by reference, also reports that 
this fragment contains the switch sequence, and this fragment 
together with the 7.7 kb Hindlll to BamHI fragment must 
include all of the sequences we need for the transgene 
construct. An intronic sequence is a nucleotide sequence of 
at least 15 contiguous nucleotides that occurs in an intron of 
a specified gene. 

Phage clones containing the 7-1 region are 
identified and isolated using the following oligonucleotide 
which is specific for the third exon of 7-1 (CH3) . 

5' TGA GCC ACG AAG ACC CTG AGG 

TCA AGT TCA ACT GGT ACG TGG 3 1 

A 7.7 kb Hindlll to Bglll fragment (fragment (a) in 
Fig. 11) is cloned into Hindlll/Bglll cut pRE3 to form pREGl . 
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The upstream 5.3 kb Hindlll fragment (fragment (b) in Fig. 11) 
is cloned into Hindlll digested pREGl to form pREG2 . Correct 
orientation is confirmed by BamHI/Spel digestion. 

5 F. Combining C7 and Cu 

The previously described plasmid pHIGl contains 
human J segments and the Cm constant region exons. To provide 
a transgene containing the C/x constant region gene segments, 
pHIGl was digested with Sfil (Fig. 10) . The plasmid pREG2 was 

10 also digested with Sfil to produce a 13.5 kb insert containing 
human C7 exons and the rat 3' enhancer sequence. These 
sequences were combined to produce the plasmid pHIG3 1 (Fig. 
12) containing the human J segments, the human C/x constant 
region, the human C7I constant region and the rat 3' enhancer 

15 contained on a 31.5 kb insert. 

A second plasmid encoding human C/i and human C7I 
without J segments is constructed by digesting pCONl with Sfil 
and combining that with the Sfil fragment containing the human 
C7 region and the rat 3 f enhancer by digesting pREG2 with 

2 0 Sfil. The resultant plasmid, pCON (Fig. 12) contains a 26 kb 

Notl/Spel insert containing human Qu, human 71 and the rat 3 1 
enhancer sequence . 

G. Cloning of D Segment 
25 The strategy for cloning the human D segments is 

depicted in Fig. 13. Phage clones from the human genomic 
library containing D segments are identified and isolated 
using probes specific for diversity region sequences (Ichihara 
et al., EMBQ J . 7:4141-4150 (1988)). The following 

3 0 oligonucleotides are used: 

DXP1 : 5 1 - TGG TAT TAC TAT GGT TCG GGG AGT TAT TAT 

AAC CAC AGT GTC - 3 ' 

35 DXP4: 5' - GCC TGA AAT GGA GCC TCA GGG CAC AGT GGG 

CAC GGA CAC TGT - 3 1 

DN4: 5' - GCA GGG AGG ACA TGT TTA GGA TCT GAG GCC 
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GCA CCT GAC ACC - 3 1 

A 5.2 kb Xhol fragment (fragment (b) in Fig. 13) 
containing DLR1, DXP1, DXP'l, and DAI is isolated from a phage 
clone identified with oligo DXP1. 

A 3.2 kb Xbal fragment (fragment (c) in Fig. 13) 
containing DXP 4 , DA4 and DK4 is isolated from a phage clone 
identified with oligo DXP 4 . 

Fragments (b) , (c) and (d) from Fig. 13 are combined 
and cloned into the Xbal /Xhol site of pGPl to form pHIG2 which 
contains a 10.6 kb insert. 

This cloning is performed sequentially. First, the 
5.2 kb fragment (b) in Fig. 13 and the 2.2 kb fragment (d) of 
Fig. 13 are treated with calf intestinal alkaline phosphatase 
and cloned into pGPl digested with Xhol and Xbal. The 
resultant clones are screened with the 5.2 and 2.2 kb insert. 
Half of those clones testing positive with the 5.2 and 2.2 kb 
inserts have the 5.2 kb insert in the proper orientation as 
determined by BamHI digestion. The 3.2 kb Xbal fragment from 
Fig. 13 is then cloned into this intermediate plasmid 
containing fragments (b) and (d) to form pHIG2 . This plasmid 
contains diversity segments cloned into the polylinker with a 
unique 5 1 Sfil site and unique 3' Spel site. The entire 
polylinker is flanked by NotI sites. 

H. Construction of Heavy Chain Minilocus 

The following describes the construction of a human 
heavy chain mini-locus which contain one or more V segments. 

An unrearranged V segment corresponding to that 
identified as the V segment contained in the hybridoma 
of Newkirk et al., J. Clin. Invest. 8JL: 1511-1518 (1988), which 
is incorporated herein by reference, is isolated using the 
following oligonucleotide: 

5 1 - GAT CCT GGT TTA GTT AAA GAG GAT TTT 
ATT CAC CCC TGT GTC - 3 • 
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A restriction map of the unrearranged V segment is 
determined to identify unique restriction sites which provide 
upon digestion a DNA fragment having a length approximately 2 
kb containing the unrearranged V segment together with 5 ' and 
5 3 1 flanking sequences. The 5 f prime sequences will include 
promoter and other regulatory sequences whereas the 3 1 
flanking sequence provides recombination sequences necessary 
for V-DJ joining. This approximately 3 . 0 kb V segment insert 
is cloned into the polylinker of pGB2 to form pVHl. 

10 pVHl is digested with Sfil and the resultant 

fragment is cloned into the Sfil site of pHIG2 to form a 
pHIGS'. Since pHIG2 contains D segments only, the resultant 
pHIG5' plasmid contains a single V segment together with D 
segments. The size of the insert contained in pHIGS is 10.6 

15 kb plus the size of the V segment insert. 

The insert from pHIGS is excised by digestion with 
NotI and Spel and isolated. pHIG3 1 which contains J , C/x and 
C7I segments is digested with Spel and NotI and the 3 1 kb 
fragment containing such sequences and the rat 3 1 enhancer 

2 0 sequence is isolated. These two fragments are combined and 

ligated into NotI digested pGPl to produce pHIG which contains 
insert encoding a V segment, nine D segments, six functional J 
segments, C/x, C7 and the rat 3' enhancer. The size of this 
insert is approximately 43 kb plus the size of the V segment 
25 insert. 

I. Construction of Heavy Chain Minilocus 

by Homologous Recombination 

As indicated in the previous section, the insert of 
30 pHIG is approximately 43 to 45 kb when a single V segment is 
employed. This insert size is at or near the limit of that 
which may be readily cloned into plasmid vectors. In order to 
provide for the use of a greater number of V segments, the 
following describes in vivo homologous recombination of 

3 5 overlapping DNA fragments which upon homologous recombination 

within a zygote or ES cell form a transgene containing the rat 
3 1 enhancer sequence, the human C/x, the human C7I, human J 
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segments, human D segments and a multiplicity of human V 
segments . 

A 6.3 kb BamHI/Hindlll fragment containing human J 
segments (see fragment (a) in Fig. 9) is cloned into Mlul/Spel 
digested pHIG5 1 using the following adapters: 

5 1 GAT CCA AGC AGT 3 1 
5 1 CTA GAC TGC TTG 3 ' 

5 • CGC GTC GAA CTA 3 » 

5 1 AGC TTA GTT CGA 3 f 

The resultant is plasmid designated pHIG5 f O 
(overlap) . The insert contained in this plasmid contains 
human V, D and J segments. When the single V segment from 
pVHl is used, the size of this insert is approximately 17 kb 
plus 2 kb. This insert is isolated and combined with the 
insert from pHIG3 1 which contains the human J, Cm, T 1 and rat 
3 f enhancer sequences. Both inserts contain human J segments 
which provide for approximately 6.3 kb of overlap between the 
two DNA fragments. When coinjected into the mouse zygote, in 
vivo homologous recombination occurs generating a transgene 
equivalent to the insert contained in pHIG. 

This approach provides for the addition of a 
multiplicity of V segments into the transgene formed in vivo* 
For example, instead of incorporating a single V segment into 
pHIG5', a multiplicity of V segments contained on (1) isolated 
genomic DNA, (2) ligated DNA derived from genomic DNA, or (3) 
DNA encoding a synthetic V segment repertoire is cloned into 
pHIG2 at the Sfil site to generate pHIG5 ' V N . The J segments 
fragment (a) of Fig. 9 is then cloned into pHIGS ' V N and the 
insert isolated. This insert now contains a multiplicity of V 
segments and J segments which overlap with the J segments 
contained on the insert isolated from pHIG3 1 . When 
cointroduced into the nucleus of a mouse zygote, homologous 
recombination occurs to generate in vivo the transgene 
encoding multiple V segments and multiple J segments, multiple 
D segments, the C\x region, the C7I region (all from human) and 
the rat 3* enhancer sequence. 
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EXAMPLE 5 

Construction of Light: Chain Minilocus 

A. Construction of pEul 

The construction of pEjxl is depicted in Fig. 16. 
5 The mouse heavy chain enhancer is isolated on the Xbal to 

EcoRI 678 bp fragment (Banerji et al., Cell 32: 729-740 (1983)) 
from phage clones using oligo: 

5' GAA TGG GAG TGA GGC TCT CTC ATA CCC 
10 TAT TCA GAA CTG ACT 3 • 

This EfM fragment is cloned into EcoRV/Xbal digested 
pGPl by blunt end filling in EcoRI site. The resultant 
15 plasmid is designated pEmul. 

B. Construction Of k Light chain Minilocus 

The k construct contains at least one human V K 
segment, all five human J K segments, the human J-C K enhancer, 

20 human k constant region exon, and, ideally, the human 3' k 
enhancer (Meyer et al., EMBO J . 8:1959-1964 (1989)). The k 
enhancer in mouse is 9 kb downstream from C K . However, it is 
as yet unidentified in the human. In addition, the construct 
contains a copy of the mouse heavy chain J-C/z enhancers. 

25 The minilocus is constructed from four component 

fragments : 

(a) A 16 kb Smal fragment that contains the human 
C K exon and the 3 1 human enhancer by analogy with the mouse 
locus ; 

30 (b) A 5 1 adjacent 5 kb Smal fragment, which 

contains all five J segments; 

(c) The mouse heavy chain intronic enhancer 
isolated from pE/xl (this sequence is included to induce 
expression of the light chain construct as early as possible 

35 in B-cell development. Because the heavy chain genes are 
transcribed earlier than the light chain genes, this heavy 
chain enhancer is presumably active at an earlier stage than 
the intronic k enhancer) ; and 

(d) A fragment containing one or more V segments. 
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The preparation of this construct is as follows. 
Human placental DNA is digested with Smal and fractionated on 
agarose gel by electrophoresis* Similarly, human placental 
DNA is digested with BamHI and fractionated by 
5 electrophoresis. The 16 kb fraction is isolated from the Smal 
digested gel and the 11 kb region is similarly isolated from 
the gel containing DNA digested with BamHI. 

The 16 kb Smal fraction is cloned into Lambda FIX II 
(Stratagene, La Jolla, California) which has been digested 
10 with Xhol, treated with klenow fragment DNA polymerase to fill 
in the Xhol restriction digest product. Ligation of the 16 kb 
Smal fraction destroys the Smal sites and lases Xhol sites 
intact. 

The 11 kb BamHI fraction is cloned into X EMBL3 
15 (Strategene, La Jolla, California) which is digested with 
BamHI prior to cloning. 

Clones from each library were probed with the Ck 
specific oligo: 

2 0 5* GAA CTG TGG CTG CAC CAT CTG TCT 

TCA TCT TCC CGC CAT CTG 3 1 

A 16 kb Xhol insert that was subcloned into the Xhol 
25 cut pE/xl so that Ck is adjacent to the Smal site. The 
resultant plasmid was designated pKapl. 

The above Ck specific oligonucleotide is used to 
probe the X EMBL3 /BamHI library to identify an 11 kb clone. A 
5 kb Smal fragment (fragment (b) in Fig. 20) is subcloned and 

3 0 subsequently inserted into pKapl digested with Smal. Those 

plasmids containing the correct orientation of J segments, Ck 
and the Ejti enhancer are designated pKap2 . 

One or more Vk segments are thereafter subcloned 
into the Mlul site of pKap2 to yield the plasmid pKapH which 

3 5 encodes the human Vk segments, the human Jk segments, the 

human Ck segments and the human E^t enhancer. This insert is 
excised by digesting pKapH with NotI and purified by agarose 
gel electrophoresis . The thus purified insert is 
microinjected into the pronucleus of a mouse zygote as 

40 previously described. 
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C. Construction of k Light Chain Minilocus by 
In Vivo Homologous Recombination 

The 11 kb BamHI fragment is cloned into BamHI 
5 digested pGPl such that the 3 f end is toward the Sfil site. 
The resultant plasmid is designated pKAPint. One or more Vk 
segments is inserted into the polylinker between the BamHI and 
Spel sites in pKAPint to form pKapHV. The insert of pKapHV is 
excised by digestion with NotI and purified. The insert from 

10 pKap2 is excised by digestion with NotI and purified. Each of 
these fragments contain regions of homology in that the 
fragment from pKapHV contains a 5 kb sequence of DNA that 
include the J K segments which is substantially homologous to 
the 5 kb Smal fragment contained in the insert obtained from 

15 pKap2. As such, these inserts are capable of homologous ly 
recombining when microinjected into a mouse zygote to form a 
transgene encoding V K/ J K and C K . 

EXAMPLE 6 

20 

Isolation of Genomic Clones 
Corresponding to Rearranged and Expressed 
Copies of Immunoglobulin k Light Chain Genes 

25 This example describes the cloning of immunoglobulin 

k light chain genes from cultured cells that express an 
immunoglobulin of interest. Such cells may contain multiple 
alleles of a given immunoglobulin gene. For example, a 
hybridoma might contain four copies of the k light chain gene, 

3 0 two copies from the fusion partner cell line and two copies 
from the original B-cell expressing the immunoglobulin of 
interest. Of these four copies, only one encodes the 
immunoglobulin of interest, despite the fact that several of 
them may be rearranged. The procedure described in this 

35 example allows for the selective cloning of the expressed copy 
of the k light chain. 

A. Double Stranded cDNA 

Cells from human hybridoma, or lymphoma, or other 
40 cell line that synthesizes either cell surface or secreted or 
both forms of IgM with a k light chain are used for the 
isolation of polyA-h RNA. The RNA is then used for the 
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synthesis of oligo dT primed cDNA using the enzyme reverse 
transcriptase (for general methods see , Goodspeed et al. 
(1989) Gene 76 : 1; Dunn et al. (1989) J. Biol. Chem. 264 : 
13 057) . The single stranded cDNA is then isolated and G 
5 residues are added to the 3* end using the enzyme 
polynucleotide terminal transferase. The G-tailed 
single-stranded cDNA is then purified and used as template for 
second strand synthesis (catalyzed by the enzyme DNA 
polymerase) using the following oligonucleotide as a primer: 

10 

5 1 - GAG GTA CAC TGA CAT ACT GGC ATG CCC 
CCC CCC CCC - 3 1 



15 The double stranded cDNA is isolated and used for 

determining the nucleotide sequence of the 5 1 end of the mRNAs 
encoding the heavy and light chains of the expressed 
immunoglobulin molecule. Genomic clones of these expressed 
genes are then isolated. The procedure for cloning the 

20 expressed light chain gene is outlined in part B below. 

B. Light Chain 

The double stranded cDNA described in part A is 
denatured and used as a template for a third round of DNA 
25 synthesis using the following oligonucleotide primer: 

5 1 - GTA CGC CAT ATC AGC TGG ATG AAG TCA TCA GAT 

GGC GGG AAG ATG AAG ACA GAT GGT GCA - 3 » 

30 

This primer contains sequences specific for the 
constant portion of the k light chain message (TCA TCA GAT GGC 
GGG AAG ATG AAG ACA GAT GGT GCA) as well as unique sequences 
that can be used as a primer for the PCR amplification of the 
35 newly synthesized DNA strand (GTA CGC CAT ATC AGC TGG ATG 

AAG) . The sequence is amplified by PGR using the following 
two oligonucleotide primers: 
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5 f - GAG GTA CAC TGA CAT ACT GGC ATG -3 1 
5 1 - GTA CGC CAT ATC AGC TGG ATG AAG -3 f 
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The PCR amplified sequence is then purified by gel 
electrophoresis and used as template for dideoxy sequencing 
reactions using the following oligonucleotide as a primer: 

5 1 - GAG GTA CAC TGA CAT ACT GGC ATG -3 ' 

The first 42 nucleotides of sequence will then be 
used to synthesize a unique probe for isolating the gene from 
which immunoglobulin message was transcribed. This synthetic 
42 nucleotide segment of DNA will be referred to below as 
o-kappa . 

A Southern blot of DNA, isolated from the Ig 
expressing cell line and digested individually and in pairwise 
combinations with several different restriction endonucleases 
including Smal, is then probed with the 3 2-P labelled unique 
oligonucleotide o-kappa. A unique restriction endonuclease 
site is identified upstream of the rearranged V segment. 

DNA from the Ig expressing cell line is then cut 
with Smal and second enzyme (or BamHI or Kpnl if there is Smal 
site inside V segment) . Any resulting non-blunted ends are 
treated with the enzyme T4 DNA polymerase to give blunt ended 
DNA molecules. Then add restriction site encoding linkers 
(BamHI, EcoRI or Xhol depending on what site does not exist in 
fragment) and cut with the corresponding linker enzyme to give 
DNA fragments with BamHI, EcoRI or Xhol ends. The DNA is then 
size fractionated by agarose gel electrophoresis, and the 
fraction including the DNA fragment covering the expressed V 
segment is cloned into lambda EMBL3 or Lambda FIX (Stratagene, 
La Jolla, California) . V segment containing clones are 
isolated using the unique probe o-kappa. DNA is isolated from 
positive clones and subcloned into the polylinker of pKapl. 
The resulting clone is called pRKL. 

EXAMPLE 7 

Isolation of Genomic Clones 
Corresponding to Rearranged Expressed Copies 
of Immunoglobulin Heavy Chain u Genes 

This example describes the cloning of immunoglobulin 

heavy chain /x genes from cultured cells of expressed and 
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immunoglobulin of interest. The procedure described in this 
example allows for the selective cloning of the expressed copy 
of a m heavy chain gene. 

Double-stranded cDNA is prepared and isolated as 
described herein before* The double-stranded cDNA is 
denatured and used as a template for a third round of DNA 
synthesis using the following oligonucleotide primer: 

5 1 - GTA CGC CAT ATC AGC TGG ATG AAG ACA GGA GAC 

GAG GGG GAA AAG GGT TGG GGC GGA TGC - 3 1 

This primer contains sequences specific for the 
constant portion of the n heavy chain message (ACA GGA GAC GAG 
GGG GAA AAG GGT TGG GGC GGA TGC) as well as unique sequences 
that can be used as a primer for the PCR amplification of the 
newly synthesized DNA strand (GTA CGC CAT ATC AGC TGG ATG 
AAG) . The sequence is amplified by PCR using the following 
two oligonucleotide primers: 

5 1 - GAG GTA CAC TGA CAT ACT GGC ATG - 3 f 
5 • - GTA CTC CAT ATC AGC TGG ATG AAG - 3 1 

The PCR amplified sequence is then purified by gel 
electrophoresis and used as template for dideoxy sequencing 
reactions using the following oligonucleotide as a primer: 

5 1 - GAG GTA CAC TGA CAT ACT GGC ATG - 3 ' 

The first 42 nucleotides of sequence are then used 
to synthesize a unique probe for isolating the gene from 
which immunoglobulin message was transcribed. This synthetic 
42 nucleotide segment of DNA will be referred to below as 
o-mu. 

A Southern blot of DNA, isolated from the Ig 
expressing cell line and digested individually and in pairwise 
combinations with several different restriction endonucleases 
including Mlul (Mlul is a rare cutting enzyme that cleaves 
between the J segment and mu CHI) , is then probed with the 
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32-P labelled unique oligonucleotide o-mu. A unique 
restriction endonuclease site is identified upstream of the 
rearranged V segment, 

DNA from the Ig expressing cell line is then cut 
with Mlul and second enzyme. Mlul or Spel adapter linkers are 
then ligated onto the ends and cut to convert the upstream 
site to Mlul or Spel. The DNA is then size fractionated by 
agarose gel electrophoresis, and the fraction including the 
DNA fragment covering the expressed V segment is cloned 
directly into the plasmid pGPI. V segment containing clones 
are isolated using the unique probe o-mu, and the insert is 
subcloned into Mlul or Mlul /Spel cut plasmid pC0N2 . The 
resulting plasmid is called pRMGH. 

EXAMPLE 8 

Construction of Human k Miniloci Transaenes 
Light Chain Minilocus 

A human genomic DNA phage library was screened with 
kappa light chain specific oligonucleotide probes and isolated 
clones spanning the J^-C region. A 5.7 kb Clal/Xhol fragment 
containing J K 1 together with a 13 kb Xhol fragment containing 
J K 2-5 and C K into pGPld was cloned and used to create the 
plasmid pKcor. This plasmid contains J K l-5, the kappa 
intronic enhancer and C K together with 4.5 kb of 5" and 9 kb of 
3' flanking sequences. It also has a unique 5' Xhol site for 
cloning V K segments and a unique 3 1 Sail site for inserting 
additional cis-acting regulatory sequences. 

V kappa genes 

A human genomic DNA phage library was screened with 
V K light chain specific oligonucleotide probes and isolated 
clones containing human V K segments. Functional V segments were 
identified by DNA sequence analysis. These clones contain 
TATA boxes, open reading frames encoding leader and variable 
peptides (including 2 cysteine residues) , splice sequences, 
and recombination heptamer-12 bp spacer-nonamer sequences. 
Three of the clones were mapped and sequenced. Two of the 
clones, 65.5 and 65.8 appear to be functional, they contain 
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TATA boxes, open reading frames encoding leader and variable 
peptides (including 2 cysteine residues) , splice sequences, 
and recombination heptamer-12 bp spacer-nonamer sequences. 
The third clone, 65.4, appears to encode a V K I pseudogene as it 
contains a non-canonical recombination heptamer. 

One of the functional clones, Vk 65-8, which encodes 
a Vklll family gene, was used to build a light chain minilocus 
construct . 

pKCl 

The kappa light chain minilocus transgene pKCl (Fig. 
32) was generated by inserting a 7 . 5 kb Xhol/Sall fragment 
containing V K 65.8 into the 5 1 Xhol site of pKcor. The 
transgene insert was isolated by digestion with NotI prior to 
injection. 

The purified insert was microinjected into the 
pronuclei of fertilized (C57BL/6 x CBA) F2 mouse embryos and 
transferred the surviving embryos into pseudopregnant females 
as described by Hogan et al. (in Methods of Manipulating the 
Mouse Embryo, 1986, Cold Spring Harbor Laboratory, New York) . 
Mice that developed from injected embryos were analyzed for 
the presence of transgene sequences by Southern blot analysis 
of tail DNA. Transgene copy number was estimated by band 
intensity relative to control standards containing known 
quantities of cloned DNA. Serum was isolated from these 
animals and assayed for the presence of transgene encoded 
human Ig kappa protein by ELISA as described by Harlow and 
Lane (in Antibodies: A Laboratory Manual, 1988, Cold Spring 
Harbor Laboratory, New York) . Microtiter plate wells were 
coated with mouse monoclonal antibodies specific for human Ig 
kappa (clone 6E1, #0173, AMAC, Inc., Westbrook, ME), human IgM 
(Clone AF6, #0285, AMAC, Inc., Westbrook, ME) and human IgGl 
(clone JL512, #0280, AMAC, Inc., Westbrook, ME) . Serum 
samples were serially diluted into the wells and the presence 
of specific immunoglobulins detected with affinity isolated 
alkaline phosphatase conjugated goat anti-human Ig 
(polyvalent) that had been pre-adsorbed to minimize cross- 
reactivity with mouse immunoglobulins. 
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Fig. 3 5 shows the results of an ELISA assay of serum 
from 8 mice (I.D. #676, 674, 673, 670, 666, 665, 664, and 
496) . The first seven of these mice developed from embryos 
that were injected with the pKCl transgene insert and the 
eighth mouse is derived from a mouse generated by 
microinjection of the pHCl transgene (described previously) . 
Two of the seven mice from KC1 injected embryos (I.D.#'s 666 
and 664) did not contain the transgene insert as assayed by 
DAN Southern blot analysis, and five of the mice (I.D.# f s 676, 
674, 673, 670, and 665) contained the transgene. All but one 
of the KC1 transgene positive animals express detectable 
levels of human Ig kappa protein, and the single non- 
expressing animal appears to be a genetic mosaic on the basis 
of DNA Southern blot analysis. The pHCl positive transgenic 
mouse expresses human IgM and IgGl but not Ig kappa, 
demonstrating the specificity of the reagents used in the 
assay. 

PKC2 

The kappa light chain minilocus transgene pKC2 was 
generated by inserting an 8 kb Xhol/Sall fragment containing 
65.5 into the 5* Xhol site of pKCl. The resulting transgene 
insert, which contains two V K segments, was isolated prior to 
microinjection by digestion with Notl. 

pKVe2 

This construct is identical to pKCl except that it 
includes 1.2 kb of additional sequence 5' of J K and is missing 
4.5 kb of sequence 3' of V K 65.8. In additional it contains a 
0.9 kb Xbal fragment containing the mouse heavy chain J-/i 
intronic enhancer (Banerji et al. , Cell 3_3:729-740 (1983)) 
together with a 1.4 kb Mlul/Hindlll fragment containing the 
human heavy chain J-jit intronic enhancer (Hayday et al . , Nature 
307 :334-340 (1984)) inserted downstream. This construct tests 
the feasibility of initiating early rearrangement of the light 
chain minilocus to effect allelic and isotypic exclusion. 
Analogous constructs can be generated with different 
enhancers, i.e., the mouse or rat 3' kappa or heavy chain 
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enhancer (Meyer and Neuberger, EMBO J . 8:1959-1964 (1989); 
Petterson et al. Nature 344 : 165-168 (1990), which are 
incorporated herein by reference) . 

Rearranged Light Chain Transgenes 

A kappa light chain expression cassette was designed 
to reconstruct functionally rearranged light chain genes that 
have been amplified by PCR from human B-cell DNA. The scheme 
is outlined in Fig. 33. PCR amplified light chain genes are 
cloned into the vector pK5nx that includes 3.7 kb of 5 1 
flanking sequences isolated from the kappa light chain gene 
65.5. The VJ segment fused to the 5' transcriptional 
sequences are then cloned into the unique Xhol site of the 
vector pK31s that includes J K 2-4 f the J K intronic enhancer, C K , 
and 9 kb of downstream sequences. The resulting plasmid 
contains a reconstructed functionally rearranged kappa light 
chain transgene that can be excised with NotI for 
microinjection into embryos. The plasmids also contain unique 
Sail sites at the 3 1 end for the insertion of additional cis- 
acting regulatory sequences. 

Two synthetic oligonucleotides (o-130, o-131) were 
used to amplify rearranged kappa light chain genes from human 
spleen genomic DNA. Oligonucleotide o-131 (gga ccc aga 
(g,c)gg aac cat gga a (g, a) (g, a, t , c) ) is complementary to the 
5* region of V^III family light chain genes and overlaps the 
first ATC of the leader sequence. Oligonucleotide o-13 0 (gtg 
caa tea att etc gag ttt gac tac aga c) is complementary to a 
sequence approximately 150 bp 3 1 of J K 1 and includes an Xhol 
site. These two oligonucleotides amplify a 0.7 kb DNA 
fragment from human spleen DNA corresponding to rearranged 
V^III genes joined to J K 1 segments. The PCR amplified DNA was 
digested with Ncol and Xhol and cloned individual PCR products 
into the plasmid pNN03. The DNA sequence of 5 clones was 
determined and identified two with functional VJ joints (open 
reading frames) . Additional functionally rearranged light 
chain clones are collected. The functionally rearranged clones 
can be individually cloned into light chain expression 
cassette described above (Fig. 33) . Transgenic mice generated 
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with the rearranged light chain constructs can be bred with 
heavy chain minilocus transgenics to produce a strain of mice 
that express a spectrum of fully human antibodies in which all 
of the diversity of the primary repertoire is contributed by 
5 the heavy chain. One source of light chain diversity can be 
from somatic mutation. Because not all light chains will be 
equivalent with respect to their ability to combine with a 
variety of different heavy chains, different strains of mice, 
each containing different light chain constructs can be 

10 generated and tested. The advantage of this scheme, as 

opposed to the use of unrearranged light chain miniloci, is 
the increased light chain allelic and isotypic exclusion that 
comes from having the light chain ready to pair with a heavy 
chain as soon as heavy chain VDJ joining occurs. This 

15 combination can result in an increased frequency of B-cells 

expressing fully human antibodies, and thus it can facilitate 
the isolation of human Ig expressing hybridomas. 

NotI inserts of plasmids pIGMl, pHCl, pIGGl, pKCl, 
and pKC2 were isolated away from vector sequences by agarose 

20 gel electrophoresis. The purified inserts were microinjected 
into the pronuclei of fertilized (C57BL/6 x CBA)F2 mouse 
embryos and transferred the surviving embryos into 
pseudopregnant females as described by Hogan et al. (Hogan et 
al., Methods of Manipulating the Mouse Embryo , Cold Spring 

25 Harbor Laboratory, New York (1986)). 

EXAMPLE 9 

Inactivation of the Mouse Kappa Light Chain Gene bv Homologous 
Recombination 

30 

This example describes the inactivation of the mouse 
endogenous kappa locus by homologous recombination in 
embryonic stem (ES) cells followed by introduction of the 
mutated gene into the mouse germ line by injection of targeted 
35 ES cells bearing an inactivated kappa allele into early mouse 
embryos (blastocysts) . 

The strategy is to delete J K and C K by homologous 
recombination with a vector containing DNA sequences 
homologous to the mouse kappa locus in which a 4.5 kb segment 
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of the locus, spanning the J K gene and C K segments, is deleted 
and replaced by the selectable marker neo. 

Construction of the kappa targeting vector 

The plasmid pGEM7 (KJ1) contains the neomycin 
resistance gene (neo) , used for drug selection of transfected 
ES cells, under the transcriptional control of the mouse 
phosphoglycerate kinase (pgk) promoter (Xbal/TaqI fragment; 
Adra et al. (1987) Gene 60: 65) in the cloning vector pGEM- 
7Zf(+). The plasmid also includes a heterologous 
polyadenylation site for the neo gene, derived from the 3« 
region of the mouse pgk gene (PvuII/Hindlll fragment; Boer et 
al., Biochemical Genetics , 28:299-308 (1990)). This plasmid 
was used as the starting point for construction of the kappa 
targeting vector. The first step was to insert sequences 
homologous to the kappa locus 3 • of the neo expression 
cassette. 

Mouse kappa chain sequences (Fig. 2 0a) were isolated 
from a genomic phage library derived from liver DNA using 
oligonucleotide probes specific for the Ck locus: 

5'- GGC TGA TGC TGC ACC AAC TGT ATC CAT CTT CCC ACC ATC CAG 
-3 1 

and for the Jk5 gene segment: 

5*- CTC ACG TTC GGT GCT GGG ACC AAG CTG GAG CTG AAA CGT AAG - 
3 ' . 

An 8 kb Bglll/SacI fragment extending 3 f of the 
mouse C K segment was isolated from a positive phage clone in 
two pieces, as a 1.2 kb Bglll/SacI fragment and a 6.8 kb Sad 
fragment, and subcloned into Bglll/SacI digested pGEM7 (KJ1) 
to generate the plasmid pNEO-K3 • (Fig. 20b). 

A 1.2 kb EcoRI/SphI fragment extending 5 f of the J K 
region was also isolated from a positive phage clone. An 
Sphl/Xbal/Bglll/EcoRI adaptor was ligated to the SphI site of 
this fragment, and the resulting EcoRI fragment was ligated 
into EcoRI digested pNE0-K3 ' , in the same 5 1 to 3 1 orientation 



121 

as the neo gene and the downstream 3' kappa sequences, to 
generate pNE0-K5 1 3 ' (Fig- 20c). 

The Herpes Simplex Virus (HSV) thymidine kinase (TK) 
gene was then included in the construct in order to allow for 
enrichment of ES clones bearing homologous recombinants, as 
described by Mansour et al., Nature 336 :348-352 (1988), which 
is incorporated herein by reference. The HSV TK cassette was 
obtained from the plasmid pGEM7 (TK) , which contains the 
structural sequences for the HSV TK gene bracketed by the 
mouse pgk promoter and polyadenylation sequences as described 
above for pGEM7 (KJ1) . . The EcoRI site of pGEM7 (TK) was 
modified to a BamHI site and the TK cassette was then excised 
as a BamHI/Hindlll fragment and subclohed into pGPlb to 
generate pGPlb-TK. This plasmid was linearized at the Xhol 
site and the Xhol fragment from pNEO-K5 1 3 1 , containing the neo 
gene flanked by genomic sequences from 5' of Jk and 3' of Ck, 
was inserted into pGPlb-TK to generate the targeting vector 
J/C KI (Fig. 2 0d) . The putative structure of the genomic 
kappa locus following homologous recombination with J/C Kl is 
shown in Fig. 20e. 

Generation and analysis of ES cells wi th targeted inactivation 
of a kappa allele 

The ES cells used were the AB-1 line grown on 
mitotically inactive SNL76/7 cell feeder layers (McMahon and 
Bradley, Cell 62.: 1073-1085 (1990)) essentially as described 
(Robertson, E.J. (1987) in Teratocarcinomas and Embryonic Stem 
Cells: A Practical Approach . E.J. Robertson, ed. (Oxford: IRL 
Press), p. 71-112). Other suitable ES lines include, but are 
not limited to, the E14 line (Hooper et al. (1987) Nature 326: 
2 92-2 95) , the D3 line (Doetschman et al. (1985) J. Embrvol. 
Exp. Morph. 87: 27-45) , and the CCE line (Robertson et al. 
(1986) Nature 323 : 445-448). The success of generating a 
mouse line from ES cells bearing a specific targeted mutation 
depends on the pluripotence of the ES cells (i.e., their 
ability, once injected into a host blastocyst, to participate 
in embryogenesis and contribute to the germ cells of the 
resulting animal) . 



122 

The pluripotence of any given ES cell line can vary 
with time in culture and the care with which it has been 
handled* The only definitive assay for pluripotence is to 
determine whether the specific population of ES cells to be 
used for targeting can give rise to chimeras capable of 
germline transmission of the ES genome* For this reason, 
prior to gene targeting, a portion of the parental population 
of AB-1 cells is injected into C57B1/6J blastocysts to 
ascertain whether the cells are capable of generating chimeric 
mice with extensive ES cell contribution and whether the 
majority of these chimeras can transmit the ES genome to 
progeny . 

The kappa chain inactivation vector J/C Kl was 
digested with NotI and electroporated into AB-1 cells by the 
methods described (Hasty et al., Nature , 350 :243-246 (1991)). 
Electroporated cells were plated onto 100 mm dishes at a 
density of 1-2 x 10 6 cells/dish. After 24 hours, G418 
(2 00jug/ial of active component) and FIAU (O.SjLiM) were added to 
the medium, and drug-resistant clones were allowed to develop 
over 10-11 days. Clones were picked, trypsinized, divided 
into two portions, and further expanded. Half of the cells 
derived from each clone were then frozen and the other half 
analyzed for homologous recombination between vector and 
target sequences . 

DNA analysis was carried out by Southern blot 
hybridization. DNA was isolated from the clones as described 
(Laird et al. , Nucl. Acids Res. 19:4293 (1991)) digested with 
Xbal and probed with the 800 bp EcoRI/Xbal fragment indicated 
in Fig. 20e as probe A. This probe detects a 3.7 kb Xbal 
fragment in the wild type locus, and a diagnostic 1.8 kb band 
in a locus which has homologously recombined with the 
targeting vector (see Fig. 2 0a and e) . Of 901 G4 18 and FIAU 
resistant clones screened by Southern blot analysis, 7 
displayed the 1.8 kb Xbal band indicative of a homologous 
recombination into one of the kappa genes. These 7 clones 
were further digested with the enzymes Bglll, SacI, and PstI 
to verify that the vector integrated homologously into one of 
the kappa genes. When probed with the diagnostic 800 bp 
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EcoRI/Xbal fragment (probe A), Bglll, Sad, and PstI digests 
of wild type DNA produce fragments of 4.1, 5.4, and 7 kb, 
respectively, whereas the presence of a targeted kappa allele 
would be indicated by fragments of 2.4, 7.5, and 5.7 kb, 
respectively (see Fig. 20a and e) . All 7 positive clones 
detected by the Xbal digest showed the expected Bglll, SacI, 
and PstI restriction fragments diagnostic of a homologous 
recombination at the kappa light chain. In addition, Southern 
blot analysis of an Nsil digest of the targeted clones using a 
neo specific probe (probe B, Fig. 2 0e) generated only the 
predicted fragment of 4.2 kb, demonstrating that the clones 
each contained only a single copy of the targeting vector. 

Generation of mice bearing the inactivated kappa chain 

Five of the targeted ES clones described in the 
previous section were thawed and injected into C57B1/6J 
blastocysts as described (Bradley, A. (1987) in 
Teratocarcinomas and Embryonic Stem Cells: A P ractical 
Approach . E.J. Robertson, ed. (Oxford: IRL Press) , p. 113-151) 
and transferred into the uteri of pseudopregnant females to 
generate chimeric mice resulting from a mixture of cells 
derived from the input ES cells and the host blastocyst. The 
extent of ES cell contribution to the chimeras can be visually 
estimated by the amount of agouti coat coloration, derived 
from the ES cell line, on the black C57B1/6J background. 
Approximately half of the offspring resulting from blastocyst 
injection of the targeted clones were chimeric (i.e., showed 
agouti as well as black pigmentation) and of these, the 
majority showed extensive (70 percent or greater) ES cell 
contribution to coat pigmentation. The AB1 ES cells are an XY 
cell line and a majority of these high percentage chimeras 
were male due to sex conversion of female embryos colonized by 
male ES cells. Male chimeras derived from 4 of the 5 targeted 
clones were bred with C57BL/6J females and the offspring 
monitored for the presence of the dominant agouti coat color 
indicative of germline transmission of the ES genome. 
Chimeras from two of these clones consistently generated 
agouti offspring. Since only one copy of the kappa locus was 
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targeted in the injected ES clones, each agouti pup had a 50 
percent chance of inheriting the mutated locus. Screening for 
the targeted gene was carried out by Southern blot analysis of 
Bgl II-digested DNA from tail biopsies, using the probe 
5 utilized in identifying targeted ES clones (probe A, Fig. 
20e) . As expected, approximately 50 percent of the agouti 
offspring showed a hybridizing Bgl II band of 2.4 kb in 
addition to the wild-type band of 4.1 kb, demonstrating the 
germline transmission of the targeted kappa locus. 

10 In order to generate mice homozygous for the 

mutation, heterozygotes were bred together and the kappa 
genotype of the offspring determined as described above. As 
expected, three genotypes were derived from the heterozygote 
matings: wild-type mice bearing two copies of a normal kappa 

15 locus, heterozygotes carrying one targeted copy of the kappa 
gene and one NT kappa gene, and mice homozygous for the kappa 
mutation. The deletion of kappa sequences from these latter 
mice was verified by hybridization of the Southern blots with 
a probe specific for J K (probe C, Fig. 2 0a) . Whereas 

2 0 hybridization of the J K probe was observed to DNA samples from 

heterozygous and wild-type siblings, no hybridizing signal was 
present in the homozygotes, attesting to the generation of a 
novel mouse strain in which both copies of the kappa locus 
have been inactivated by deletion as a result of targeted 
25 mutation. 

EXAMPLE 10 

Inactivation of the Mouse Heavy Chain Gene by Homologous 
Recombination 

3 0 This example describes the inactivation of the 

endogenous murine immunoglobulin heavy chain locus by 
homologous recombination in embryonic stem (ES) cells. The 
strategy is to delete the endogenous heavy chain J segments by 
homologous recombination with a vector containing heavy chain 
3 5 sequences from which the J H region has been deleted and 
replaced by the gene for the selectable marker neo. 

Construction of a heavy chain targeting vector 
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Mouse heavy chain sequences containing the J H region 
(Fig. 21a) were isolated from a genomic phage library derived 
from the D3 ES cell line (Gossler et al. , Proc. Natl. Acad. 
Sci. U.S.A. 83:9065-9069 (1986)) using a J H 4 specific 
oligonucleotide probe: 

5 f - ACT ATG CTA TGG ACT ACT GGG GTC AAG GAA CCT CAG TCA CCG 
-3 ■ 

A 3.5 kb genomic Sacl/StuI fragment, spanning the J H 
region, was isolated from a positive phage clone and subcloned 
into Sacl/Smal digested pUC18. The resulting plasmid was 
designated pUC18 J H . The neomycin resistance gene (neo) , used 
for drug selection of transfected ES cells, was derived from a 
repaired version of the plasmid pGEM7 (KJ1) . A report in the 
literature (Yenofsky et al* (1990) Proc . Na tl . Acad . Sci . 
(U.S.A. ) 87: 3435-3439) documents a point mutation the neo 
coding sequences of several commonly used expression vectors, 
including the construct pMClneo (Thomas and Cappechi (1987) 
Cell 51 ; 503-512) which served as the source of the neo gene 
used in pGEM7 (KJ1) . This mutation reduces the activity of 
the neo gene product and was repaired by replacing a 
restriction fragment encompassing the mutation with the 
corresponding sequence from a wild-type neo clone. The 
Hindlll site in the prepared pGEM7 (KJ1) was converted to a 
Sail site by addition of a synthetic adaptor, and the neo 
expression cassette excised by digestion with Xbal/Sall. The 
ends of the neo fragment were then blunted by treatment with 
the Klenow form of DNA poll, and the neo fragment was 
subcloned into the Nael site of pUC18 J H , generating the 
plasmid pUC18 J H -neo (Fig. 21b) . 

Further construction of the targeting vector was 
carried out in a derivative of the plasmid pGPlb. pGPlb was 
digested with the restriction enzyme NotI and ligated with the 
following oligonucleotide as an adaptor: 

5»- GGC CGC TCG ACG ATA GCC TCG AGG CTA TAA ATC TAG AAG AAT 
TCC AGC AAA GCT TTG GC -3 1 
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The resulting plasmid, called pGMT, was used to 
build the mouse immunoglobulin heavy chain targeting 
construct . 

The Herpes Simplex Virus (HSV) thymidine kinase (TK) 
gene was included in the construct in order to allow for 
enrichment of ES clones bearing homologous recombinants, as 
described by Mansour et al. (Nature 336 . 348-352 (1988)). The 
HSV TK gene was obtained from the plasmid pGEM7 (TK) by 
digestion with EcoRI and Hindlll. The TK DNA fragment was 
subcloned between the EcoRI and Hindlll sites of pGMT , 
creating the plasmid pGMT-TK (Fig. 21c) . 

To provide an extensive region of homology to the 
target sequence, a 5.9 kb genomic Xbal/Xhol fragment, situated 
5* of the J H region, was derived from a positive genomic phage 
clone by limit digestion of the DNA with Xhol, and partial 
digestion with Xbal. As noted in Fig. 21a, this Xbal site is 
not present in genomic DNA, but is rather derived from phage 
sequences immediately flanking the cloned genomic heavy chain 
insert in the positive phage clone. The fragment was 
subcloned into Xbal/Xhol digested pGMT-TK, to generate the 
plasmid pGMT— TK— J H 5 1 (Fig. 21d) . 

The final step in the construction involved the 
excision from pUC18 J H -neo of the 2.8 kb EcoRI fragment which 
contained the neo gene and flanking genomic sequences 3 1 of 
J H . This fragment was blunted by Klenow polymerase and 
subcloned into the similarly blunted Xhol site of 
pGMT-TK- J H 5 1 . The resulting construct, J H K01 (Fig. 21e) , 
contains 6.9 kb of genomic sequences flanking the J H locus, 
with a 2.3 kb deletion spanning the J H region into which has 
been inserted the neo gene. Fig. 2 If shows the structure of 
an endogenous heavy chain gene after homologous recombination 
with the targeting construct. 

EXAMPLE 11 

Generation and analysis of targeted ES cells 

AB-1 ES cells (McMahon and Bradley, Cell 
62: 1073-1085 (1990)) were grown on mitotically inactive 
SNL76/7 cell feeder layers essentially as described 
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(Robertson, E.J* (1987) Teratocarcinomas and Embryonic Stem 
Cells: A Practical Approach , E.J- Robertson, ed. (Oxford: IRL 
Press) , pp. 71-112) . As described in the previous example, 
prior to electroporation of ES cells with the targeting 
construct J H K01, the pluripotency of the ES cells was 
determined by generation of AB-1 derived chimeras which were 
shown capable of germline transmission of the ES genome. 

The heavy chain inactivation vector J H K01 was 
digested with NotI and electroporated into AB-1 cells by the 
methods described (Hasty et al. , Nature 350 :243-246 (1991)). 
Electroporated cells were plated into 100 mm dishes at a 
density of 1-2 x 10 6 cells/dish. After 24 hours, G418 
(200mg/ml of active component) and FIAU (0.5mM) were added to 
the medium, and drug-resistant clones were allowed to develop 
over 8-10 days. Clones were picked, trypsinized, divided into 
two portions, and further expanded. Half of the cells derived 
from each clone were then frozen and the other half analyzed 
for homologous recombination between vector and target 
sequences . 

DNA analysis was carried out by Southern blot 
hybridization. DNA was isolated from the clones as described 
(Laird et al. (1991) Nucleic Acids Res. 19: 4293), digested 
with StuI and probed with the 500 bp EcoRI/StuI fragment 
designated as probe A in Fig. 21f . This probe detects a StuI 
fragment of 4.7 kb in the wild-type locus, whereas a 3 kb band 
is diagnostic of homologous recombination of endogenous 
sequences with the targeting vector (see Fig. 21a and f ) . Of 
525 G418 and FIAU doubly-resistant clones screened by Southern 
blot hybridization, 12 were found to contain the 3 kb fragment 
diagnostic of recombination with the targeting vector. That 
these clones represent the expected targeted events at the J H 
locus (as shown in Fig. 21f) was confirmed by further 
digestion with Hindlll, Spel and Hpal. Hybridization of probe 
A (see Fig. 21f) to Southern blots of Hindlll, Spel, and Hpal 
digested DNA produces bands of 2.3 kb, >10 kb, and >10kb, 
respectively, for the wild-type locus (see Fig. 21a) , whereas 
bands of 5.3 kb, 3.8 kb, and 1.9 kb, respectively, are 
expected for the targeted heavy chain locus (see Fig 2 If ) . 
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All 12 positive clones detected by the StuI digest showed the 
predicted Hindlll, Spel, and Hpal bands diagnostic of a 
targeted J H gene. In addition, Southern blot analysis of a 
StuI digest of all 12 clones using a neo-specific probe (probe 
B, Fig- 21f) generated only the predicted fragment of 3 kb f 
demonstrating that the clones each contained only a single 
copy of the targeting vector. 

Generation of mice carrying the J H deletion 

Three of the targeted ES clones described in the 
previous section were thawed and injected into C57BL/6 J 
blastocysts as described (Bradley, A. (1987) in 
Teratocarcinomas and Embryonic Stem Cells: A Pra ctical 
Approach . E.J. Robertson, ed. (Oxford: IRL Press), p. 113-151) 
and transferred into the uteri of pseudopregnant females. The 
extent of ES cell contribution to the chimera was visually 
estimated from the amount of agouti coat coloration, derived 
from the ES cell line, on the black C57BL/6J background. Half 
of the offspring resulting from blastocyst injection of two of 
the targeted clones were chimeric (i.e., showed agouti as well 
as black pigmentation) ; the third targeted clone did not 
generate any chimeric animals. The majority of the chimeras 
showed significant (approximately 50 percent or greater) ES 
cell contribution to coat pigmentation. Since the AB-1 ES 
cells are an XY cell line, most of the chimeras were male, due 
to sex conversion of female embryos colonized by male ES 
cells. Males chimeras were bred with C57BL/6J females and the 
offspring monitored for the presence of the dominant agouti 
coat color indicative of germline transmission of the ES 
genome. Chimeras from both of the clones consistently 
generated agouti offspring. Since only one copy of the heavy 
chain locus was targeted in the injected ES clones, each 
agouti pup had a 50 percent chance of inheriting the mutated 
locus. Screening for the targeted gene was carried out by 
Southern blot analysis of Stul-digested DNA from tail 
biopsies, using the probe utilized in identifying targeted ES 
clones (probe A, Fig. 2 If ) . As expected, approximately 50 
percent of the agouti offspring showed a hybridizing StuI band 
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of approximately 3 kb in addition to the wild-type band of 4.7 
3cb, demonstrating germline transmission of the targeted J H 
gene segment* 

In order to generate mice homozygous for the 
mutation, heterozygotes were bred together and the heavy chain 
genotype of the offspring determined as described above. As 
expected, three genotypes were derived from the heterozygote 
matings: wild-type mice bearing two copies of the normal J H 
locus, heterozygotes carrying one targeted copy of the gene 
and one normal copy, and mice homozygous for the J H mutation. 
The absence of J H sequences from these latter mice was 
verified by hybridization of the Southern blots of Stul- 
digested DNA with a probe specific for J H (probe C, Fig. 21a) . 
Whereas hybridization of the J H probe to a 4 . 7 kb fragment in 
DNA samples from heterozygous and wild-type siblings was 
observed, no signal was present in samples from the J H -mutant 
homozygotes, attesting to the generation of a novel mouse 
strain in which both copies of the heavy chain gene have been 
mutated by deletion of the J H sequences. 

EXAMPLE 12 
Heavy Chain Minilocus Transaene 

A. Construction of plasmid vectors for cloning large DNA 

sequences 

1. PGP la 

The plasmid pBR3 22 was digested with EcoRI and Sty I 
and ligated with the following oligonucleotides: 

oligo-42 5'- caa gag ccc gcc taa tga gcg ggc ttt ttt ttg cat 
act gcg gcc get -3 1 

oligo-43 5'- aat tag egg ccg cag tat gca aaa aaa age ccg etc 
att agg egg get -3 1 

The resulting plasmid, pGPla, is designed for 
cloning very large DNA constructs that can be excised by the 
rare cutting restriction enzyme Notl. It contains a NotI 
restriction site downstream (relative to the ampicillin 
resistance gene, AmpR) of a strong transcription termination 
signal derived from the trpA gene (Christie et al., Proc. 



130 

Natl. Acad. Sci. USA 78.:4180 (1981)). This termination signal 
reduces the potential toxicity of coding sequences inserted 
into the NotI site by eliminating readthrough transcription 
from the AmpR gene. In addition, this plasmid is low copy 
relative to the pUC plasmids because it retains the pBR3 2 2 
copy number control region. The low copy number further 
reduces the potential toxicity of insert sequences and reduces 
the selection against large inserts due to DNA replication. 
The vectors pGPlb, pGPlc, pGPld, and pGPlf are derived from 
pGPla and contain different polylinker cloning sites. The 
poly linker sequences are given below 

pGPla 

NotI 
GCGGCCGC 

pGPlb 

NotI Xhol Clal BamHI Hindlll NotI 

GCggccgcctcgagatcactatcgattaattaaggatccagcagtaagcttgcGGCCGC 

pGIlc 

NotI Smal Xhol Sail Hindlll BamHI SacII NotI 
GCggccgcatcccgggtctcgaggtcgacaagctttcgaggatccgcGGCCGC 

pGPld 

NotI Sail Hindlll Clal BamHI Xhol NotI 
GCggccgctgtcgacaagcttatcgatggatcctcgagtgcGGCCGC 

pGPlf 

NotI Sail Hindlll EcoRI Clal Kpnl BamHI Xhol NotI 

GCggccgctgtcgacaagcttcgaattcagatcgatgtggtacctggatcctcgagtgcGGCCGC 

Each of these plasmids can be used for the construction of 
large transgene inserts that are excisable with NotI so that 
the transgene DNA can be purified away from vector sequences 
prior to microinjection. 



2 . pGPlb 



pGPla was digested with NotI and ligated with the 
following oligonucleotides : 



oligo-47 5 1 - ggc cgc aag ctt act get gga tec tta att aat cga 
tag tga tct cga ggc -3 1 

oligo-48 5 f - ggc cgc etc gag ate act ate gat taa tta agg ate 
cag cag taa get tgc -3 1 

The resulting plasmid, pGPlb, contains a short 
polylinker region flanked by NotI sites. This facilitates the 
construction of large inserts that can be excised by NotI 
digestion. 

3 . pGPe 

The following oligonucleotides: 

oligo-44 5 1 - etc cag gat cca gat ate agt ace tga aac agg get 
tgc -3 1 

oligo-45 5 1 - etc gag cat gca cag gac ctg gag cac aca cag cct 
tec -3 • 

were used to amplify the immunoglobulin heavy chain 3 1 
enhancer (S. Petterson, et al., Nature 344 : 165-168 (1990)) 
from rat liver DNA by the polymerase chain reaction technique. 

The amplified product was digested with BamHI and 
SphI and cloned into BamHI/SphI digested pNN03 (pNN03 is a pUC 
derived plasmid that contains a polylinker with the following 
restriction sites, listed in order: NotI, BamHI, Ncol , Clal, 
EcoRV, Xbal, SacI, Xhol, SphI, PstI , Bglll, EcoRI, Smal, Kpnl, 
Hindlll, and NotI). The resulting plasmid, pRE3 , was digested 
with BamHI and Hindlll, and the insert containing the rat Ig 
heavy chain 3' enhancer cloned into BamHI/Hindlll digested 
pGPlb. The resulting plasmid, pGPe (Fig. 22 and Table 1) , 
contains several unique restriction sites into which sequences 
can be cloned and subsequently excised together with the 3 ' 
enhancer by NotI digestion. 
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Sequence of ve ctor pGFe. 
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B. Construction of IaM expressing minilocus transgene, pIGMl 

1. Isolation of J-u constant region clones and construction 
of pJMl 

A human placental genomic DNA library cloned into 
the phage vector XEMBL3 /SP6/T7 (Clonetech Laboratories, Inc., 
Palo Alto, CA) was screened with the human heavy chain J 
region specific oligonucleotide: 

oligo-1 5 1 - gga ctg tgt ccc tgt gtg atg ctt ttg atg tct ggg 
gcc aag -3 1 

and the phage clone Xl.3 isolated* A 6 kb Hindlll/Kpnl 
fragment from this clone, containing all six J segments as 
well as D segment DHQ52 and the heavy chain J-jLt intronic 
enhancer, was isolated* The same library was screened with 
the human \i specific oligonucleotide: 

oligo-2 5'- cac caa gtt gac ctg cct ggt cac aga cct gac cac 
eta tga -3 ' 

and the phage clone \2.1 isolated. A 10.5 kb Hindlll/Xhol 
fragment, containing the \l switch region and all of the /z 
constant region exons, was isolated from this clone. These 
two fragments were ligated together with KpnI/XhoI digested 
pNN03 to obtain the plasmid pJMl. 

2 . pJM2 

A 4 kb Xhol fragment was isolated from phage clone 
\2 . 1 that contains sequences immediately downstream of the 
sequences in pJMl, including the so called Em element involved 
in ^-associated deleteon of the /x in certain IgD expressing 
B-cells (Yasui et al., Eur. J. Immunol . 19:1399 (1989), which 
is incorporated herein by reference) . This fragment was 
treated with the Klenow fragment of DNA polymerase I and 
ligated to Xhol cut, Klenow treated, pJMl. The resulting 
plasmid, pJM2 (Fig. 23), had lost the internal Xhol site but 
retained the 3 1 Xhol site due to incomplete reaction by the 
Klenow enzyme. pJM2 contains the entire human J region, the 
heavy chain J-/x intronic enhancer, the \l switch region and all 



of the p constant, region exons, as well as the two 0.4 kb 
direct repeats, a\i and Eju, involved in 6-associated deletion 
of the jLt gene* 

3 . Isolation of D region clones and construction of pDHl 

The following human D region specific 
oligonucleotide: 

oligo-4 5'- tgg tat tac tat ggt teg ggg agt tat tat aac cac 
agt gtc -3 1 

was used to screen the human placenta genomic library for D 
region clones. Phage clones X4 . 1 and X4 . 3 were isolated. A 
5.5 kb Xhol fragment, that includes the D elements D K1 , D N1/ 
and D M2 (Ichihara et al. , EMBQ J . 7:4141 (1988)), was isolated 
from phage clone \4 . 1 . An adjacent upstream 5.2 kb Xhol 
fragment, that includes the D elements D LR1 , D xpl , D X p. x , and 
D A1 , was isolated from phage clone \4 . 3 . Each of these D 
region Xhol fragments were cloned into the Sail site of the 
plasmid vector pSP72 (Promega, Madison, Wl) so as to destroy 
the Xhol site linking the two sequences. The upstream 
fragment was then excised with Xhol and Smal, and the 
downstream fragment with EcoRV and Xhol. The resulting 
isolated fragments were ligated together with Sail digested 
pSP72 to give the plasmid pDHl. pDHl contains a 10.6 kb 
insert that includes at least 7 D segments and can be excised 
with Xhol (5») and EcoRV (3»)* 

4 . pCORl 

The plasmid pJM2 was digested with Asp718 (an 
isoschizomer of Kpnl) and the overhang failed in with the 
Klenow fragment of DNA polymerase I. The resulting DNA was 
then digested with Clal and the insert isolated. This insert 
was ligated to the Xhol /EcoRV insert of pDHl and Xhol /Clal 
digested pGPe to generate pCORl (Fig. 24) . 

5. PVH251 

A 10.3 kb genomic Hindlll fragment containing the 
two human heavy chain variable region segments V H 251 and V H 105 
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(Humphries et al., Nature 331 :446 (1988), which is 
incorporated herein by reference) was subcloned into pSP72 to 
give the plasmid pVH2 51. 

6. pIGMl 

The plasmid pCORl was partially digested with Xhol 
and the isolated Xhol /Sail insert of pVH251 cloned into the 
upstream Xhol site to generate the plasmid pIGMl (Fig. 25) . 
pIGMl contains 2 functional human variable region segments, at 
least 8 human D segments all 6 human J H segments, the human 
J-fM enhancer, the human a\i element, the human /x switch region, 
all of the human /x coding exons, and the human S/i element, 
together with the rat heavy chain 3 1 enhancer, such that all 
of these sequence elements can be isolated on a single 
fragment, away from vector sequences, by digestion with NotI 
and microinjected into mouse embryo pronuclei to generate 
transgenic animals . 

C. Construction of laM and IaG expressing minilocus 
transaene . pHCl 

1. Isolation of 7 constant region clones 

The following oligonucleotide, specific for human Ig 
g constant region genes: 

oligo-29 5'- cag cag gtg cac acc caa tgc cca tga gcc cag aca 
ctg gac -3 f 

was used to screen the human genomic library. Phage clones 
12 9.4 and \29.5 were isolated. A 4 kb Hindlll fragment of 
phage clone X29.4, containing a 7 switch region, was used to 
probe a human placenta genomic DNA library cloned into the 
phage vector lambda FIX™ II (Stratagene, La Jolla, CA) . Phage 
clone XSgl.13 was isolated. To determine the subclass of the 
different 7 clones, dideoxy sequencing reactions were carried 
out using subclones of each of the three phage clones as 
templates and the following oligonucleotide as a primer: 

oligo-67 5 f - tga gcc cag aca ctg gac -3' 
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Phage clones \29.5 and \S7l.i3 were both determined 
to be of the 71 subclass. 

2 . pyel 

A 7.8 kb Hindlll fragment of phage clone \2 9.5, 
containing the 71 coding region was cloned into pUCIS . The 
resulting plasmid, pLTl, was digested with Xhol, Klenow 
treated, and religated to destroy the internal Xhol site. The 
resulting clone, pLTlxk, was digested with Hindlll and the 
insert isolated and cloned into pSP72 to generate the plasmid 
clone pLTlxks. Digestion of pLTlxks at a polylinker Xhol site 
and a human sequence derived BamHI site generates a 7.6 kb 
fragment containing the 71 constant region coding exons. This 
7.6 kb XhoI/BamHI fragment was cloned together with an 
adjacent downstream 4 . 5 kb BamHI fragment from phage clone 
X29.5 into XhoI/BamHI digested pGPe to generate the plasmid 
clone P7el. P7el contains all of the 71 constant region 
coding exons, together with 5 kb of downstream sequences, 
linked to the rat heavy chain 3 1 enhancer. 

3. P7^2 

A 5.3 kb Hindlll fragment containing the 7I switch 
region and the first exon of the pre-switch sterile transcript 
(P. Sideras et al. (1989) International Immunol. 1 , 631) was 
isolated from phage clone XS7I.13 and cloned into pSP72 with 
the polylinker Xhol site adjacent to the 5 1 end of the insert, 
to generate the plasmid clone PS7IS. The Xhol /Sail insert of 
PS7IS was cloned into Xhol digested P7el to generate the 
plasmid clone P7e2 (Fig. 2 6) . P7e2 contains all of the 7I 
constant region coding exons, and the upstream switch region 
and sterile transcript exons, together with 5 kb of downstream 
sequences, linked to the rat heavy chain 3' enhancer. This 
clone contains a unique Xhol site at the 5* end of the insert. 
The entire insert, together with the Xhol site and the 3' rat 
enhancer can be excised from vector sequences by digestion 
with Notl. 
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4 . pHCl 

The plasmid pIGMl was digested with Xhol and the 4 3 
kb insert isolated and cloned into Xhol digested pge2 to 
generate the plasmid pHCl (Fig. 25) . pHCl contains 2 
5 functional human variable region segments, at least 8 human D 
segments all 6 human *T H segments, the human J-j± enhancer, the 
human afi element, the human /i switch region, all of the human 
/x coding exons, the human Eju element, and the human 7I 
constant region, including the associated switch region and 
10 sterile transcript associated exons, together with the rat 
heavy chain 3 1 enhancer, such that all of these sequence 
elements can be isolated on a single fragment, away from 
vector sequences, by digestion with NotI and mieroinjected 
into mouse embryo pronuclei to generate transgenic animals. 

15 

D. Construction of IaM and laG expressing minilocus 
transaene , pHC2 

1. Isolation of human heavy chain V region gene VH49.8 

The human placental genomic DNA library lambda, FIX™ 
2 0 II, Stratagene, La Jolla, CA) was screened with the following 
human VH1 family specific oligonucleotide: 

oligo-49 5 f - gtt aaa gag gat ttt att cac ccc tgt gtc etc tec 
aca ggt gtc -3 1 

25 

Phage clone \49.8 was isolated and a 6.1 kb Xbal 
fragment containing the variable segment VH49.8 subcloned into 
pNN03 (such that the polylinker Clal site is downstream of 
30 VH49.8 and the polylinker Xhol site is upstream) to generate 
the plasmid pVH49.8. An 800 bp region of this insert was 
sequenced, and VH49.8 found to have an open reading frame and 
intact splicing and recombination signals, thus indicating 
that the gene is functional (Table 2) . 



\2>% 



TABLE 2 



„ w _-i rart*arara-^-* 



IS) 



jjEififi!I3Li-n. va i ^r— - ; - 



250 



La cccccrzzzz 300 
-zTr^Arc?^ sLsuPiisvaJ. VaJ^laAlsA 

aai—saag cn==g=ssgg rs.-==r=g=z ^sncz^saqa -au 

-a/ ~~«-» — aaGIGICZSG > ^ , Aia-r j.ww AGCIGSIGS. 400 

~~ w ~ - GLyValGln SerGlnVaiG I nTpnV alGI 

i j[ i — . c3fZ77ZSJ& CCTSZSfflG ^I^J-IU^. 450 

nSerdvALa. GluValLvsL vsPrcGlySe rSerVaiXys ValSerC ysL 

vu, assszsro czarcaGzis gstsgsqg soo 

vsAlaierGL vGLvTfcrPte *>r*f> rsyi& lalleSerTr pVa l&rcGlii 

'l u-i r iaGSSZTIS^ GIG3KD3G3* AGSKTCSSH: -^isu 

AlaPraGLyG laSIyLsaSI u 2iL* te & GLy Ar g npTl pP a l l g^^ 

yUeaLaAsn TyzftlaGlcL ysfiheGHlSL yAraVa lZSr IlP ThrA la ft. 

floaaamc asscaocz tzoisss: TGasssrr osacirac 

spLvsSerCi rSerCsAla TviltetGl uL euSerSer Le uArcSerGlu 

AscOffiftl aV aUvrTvrC y sAlaArg 

rrrarMff fr ramaamr GBGSSSBG G3 te . ,j*i>. 

asbsogs uiT^AiTH fis tttmgszis -nacRRaKr Gssnanaa. 



SO 
600 
SO 
700 
750 
800 
312 



Sequence aLhunaa V„I fcmtijr gene V H 49.8 



139 



2. pV2 

A 4 kb Xbal genomic fragment containing the human 
V H IV family gene V R 4-21 (Sanz et al. , EMBO J . , 8:3741 (1989)), 
subcloned into the plasmid pUC12 , was excised with Smal and 
5 Hindlll, and treated with the Klenow fragment of polymerase I. 
The blunt ended fragment was then cloned into Clal digested, 
Klenow treated, pVH49.8. The resulting plasmid, pV2 , contains 
the human heavy chain gene VH49.8 linked upstream of VH4-21 in 
the same orientation, with a unique Sail site at the 3' end of 
10 the insert and a unique Xhol site at the 5 1 end. 

3. 0S7I-5 1 

A 0.7 kb Xbal/Hindlll fragment (representing 
sequences immediately upstream of, and adjacent to, the 5.3 kb 

15 71 switch region containing fragment in the plasmid P7e2) 

together with the neighboring upstream 3 . 1 kb Xbal fragment 
were isolated from the phage clone \Sgl.l3 and cloned into 
Hindlll/Xbal digested pUC18 vector. The resulting plasmid, 
PS7I-5 ' , contains a 3.8 kb insert representing sequences 

2 0 upstream of the initiation site of the sterile transcript 
found in B-cells prior to switching to the 71 isotype (P. 
Sideras et al., International Immunol. 1:631 (1989)). Because 
the transcript is implicated in the initiation of isotype 
switching, and upstream cis-acting sequences are often 

25 important for transcription regulation, these sequences are 

included in transgene constructs to promote correct expression 
of the sterile transcript and the associated switch 
recombination . 

30 4 . pVGEl 

The PS7I-5 1 insert was excised with Smal and 
Hindlll, treated with Klenow enzyme, and ligated with the 
following oligonucleotide linker: 

35 5 1 - ccg gtc gac egg -3 1 

The ligation product was digested with Sail and ligated to 
Sail digested pV2 . The resulting plasmid, pVP, contains 3.8 
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kb of 71 switch 5 f flanking sequences linked downstream of the 
two human variable gene segments VH49J8 and VH4-21 (see Table 
2) . The pVP insert is isolated by partial digestion with Sail 
and complete digestion with Xhol, followed by purification of 
the 15 kb fragment on an agarose gel. The insert is then 
cloned into the Xhol site of pye2 to generate the plasmid 
clone pVGEl (Fig* 27) . pVGEl contains two human heavy chain 
variable gene segments upstream of the human 71 constant gene 
and associated switch region. A unique Sail site between the 
variable and constant regions can be used to clone in D, J , 
and ju gene segments. The rat heavy chain 3' enhancer is 
linked to the 3 ' end of the 7I gene and the entire insert is 
flanked by NotI sites. 

5. PHC2 

The plasmid clone pVGEl is digested with Sail and 
the Xhol insert of pIGMl is cloned into it. The resulting 
clone, pHC2 (Fig. 25) , contains 4 functional human variable 
region segments, at least 8 human D segments all 6 human J H 
segments, the human J-m enhancer, the human a/x element, the 
human fi switch region, all of the human ji coding exons, the 
human S/x element, and the human 7I constant region, including 
the associated switch region and sterile transcript associated 
exons, together with 4 kb flanking sequences upstream of the 
sterile transcript initiation site. These human sequences are 
linked to the rat heavy chain 3* enhancer, such that all of 
the sequence elements can be isolated on a single fragment, 
away from vector sequences, by digestion with NotI and 
microinjected into mouse embryo pronuclei to generate 
transgenic animals. A unique Xhol site at the 5 1 end of the 
insert can be used to clone in additional human variable gene 
segments to further expand the recombinational diversity of 
this heavy chain minilocus. 

E. Transgenic mice 

The NotI inserts of plasmids pIGMl and pHCl were 
isolated from vector sequences by agarose gel electrophoresis. 
The purified inserts were microinjected into the pronuclei of 
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fertilized (C57BL/6 x CBA) F2 mouse embryos and transferred the 
surviving embryos into pseudopregnant females as described by 
Hogan et al. (B. Hogan, F. Costantini, and E. Lacy, Methods of 
Manipulating the Mouse Embryo, 1986, Cold Spring Harbor 
Laboratory, New York) . Mice that developed from injected 
embryos were analyzed for the presence of transgene sequences 
by Southern blot analysis of tail DNA. Transgene copy number 
was estimated by band intensity relative to control standards 
containing known quantities of cloned DNA. At 3 to 8 weeks of 
age, serum was isolated from these animals and assayed for the 
presence of transgene encoded human IgM and IgGl by ELISA as 
described by Harlow and Lane (E. Harlow and D. Lane. 
Antibodies: A Laboratory Manual, 1988, Cold Spring Harbor 
Laboratory, New York) . Microtiter plate wells were coated 
with mouse monoclonal antibodies specific for human IgM (clone 
AF6, #0285, AMAC, Inc. Westbrook, ME) and human IgGl (clone 
JL512, #0280, AMAC, Inc. Westbrook, ME). Serum samples were 
serially diluted into the wells and the presence of specific 
immunoglobulins detected with affinity isolated alkaline 
phosphatase conjugated goat anti-human Ig (polyvalent) that 
had been pre-adsorbed to minimize cross-reactivity with mouse 
immunoglobulins. Table 3 and Fig. 28 show the results of an 
ELISA assay for the presence of human IgM and IgGl in the 
serum of two animals that developed from embryos injected with 
the transgene insert of plasmid pHCl. All of the control non« 
transgenic mice tested negative for expression of human IgM 
and IgGl by this assay. Mice from two lines containing the 
pIGMl NotI insert (lines #6 and 15) express human IgM but not 
human IgGl. We tested mice from 6 lines that contain the pHCl 
insert and found that 4 of the lines (lines #2 6, 38, 57 and 
122) express both human IgM and human IgGl, while mice from 
two of the lines (lines #19 and 21) do not express detectable 
levels of human immunoglobulins. The pHCl transgenic mice 
that did not express human immunoglobulins were so-called G Q 
mice that developed directly from microinjected embryos and 
may have been mosaic for the presence of the transgene. 
Southern blot analysis indicates that many of these mice 
contain one or fewer copies of the transgene per cell. The 
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detection of human IgM in the serum of pIGMl transgenics, and 
human IgM and IgGl in pHCl transgenics, provides evidence that 
the transgene sequences function correctly in directing VDJ 
joining, transcription, and isotype switching. One of the 
animals (#18) was negative for the transgene by Southern blot 
analysis, and showed no detectable levels of human IgM or 
IgGl. The second animal (#38) contained approximately 5 
copies of the transgene, as assayed by Southern blotting, and 
showed detectable levels of both human IgM and IgGl. The 
results of ELISA assays for 11 animals that developed from 
transgene injected embryos is summarized in the table below 
(Table 3) . 
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TABLE 3 

Detection of human IgM and IgGl in the serum of transgenic 
animals by ELISA assay 

approximate 
in j ected transgene 
animal # transgene copies per cell human IgM human IgGl 



pIGMl 



+ + 



25 



30 



35 



40 



9 
10 
12 
15 
18 
19 
21 
26 
38 



pIGMl 

pIGMl 

pIGMl 

pIGMl 

pIGMl 

pHCl 

pHCl 

pHCl 

pHCl 

pHCl 



0 
0 
0 
0 
10 
0 

1 
<1 



+ + 



+ + 
+ + 



+ 
+ 
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Table 3 shows a correlation between the presence of 
integrated transgene DNA and the presence of transgene encoded 
immunoglobulins in the serum. Two of the animals that were 
found to contain the pHCl transgene did not express detectable 
levels of human immunoglobulins. These were both low copy 
animals and may not have contained complete copies of the 
transgenes, or the animals may have been genetic mosaics 
(indicated by the <1 copy per cell estimated for animal #21) , 
and the transgene containing cells may not have populated the 
hematopoietic lineage. Alternatively, the transgenes may have 
integrated into genomic locations that are not conducive to 
their expression. The detection of human IgM in the serum of 
pIGMl transgenics, and human IgM and IgGl in pHCl transgenics, 
indicates that the transgene sequences function correctly in 
directing VDJ joining, transcription, and isotype switching. 

F. cDNA clones 

To assess the functionality of the pHCl transgene in 
VDJ joining and class switching, as well the participation of 
the transgene encoded human B-cell receptor in B-cell 
development and allelic exclusion, the structure of 
immunoglobulin cDNA clones derived from transgenic mouse 
spleen mRNA were examined. The overall diversity of the 
transgene encoded heavy chains, focusing on D and J segment 
usage, N region addition, CDR3 length distribution, and the 
frequency of joints resulting in functional mRNA molecules was 
examined. Transcripts encoding IgM and IgG incorporating 
VH105 and VH251 were examined. 

Polyadenylated RNA was isolated from an eleven week 
old male second generation line-57 pHCl transgenic mouse. 
This RNA was used to synthesize oligo-dT primed single 
stranded cDNA. The resulting cDNA was then used as template 
for four individual PCR amplifications using the following 
four synthetic oligonucleotides as primers: VH251 specific 
oligo-149, eta get cga gtc caa gga gtc tgt gec gag gtg cag ctg 
(g,a,t,c); VH105 specific o-150, gtt get cga gtg aaa ggt gtc 
cag tgt gag gtg cag ctg (g,a,t,c); human gammal specific 
oligo-151, ggc get cga gtt cca cga cac cgt cac egg ttc; and 



144 

human mu specific oligo-152, cct get cga ggc age caa egg cca 
cgc tgc teg. Reaction 1 used primers 0-149 and o-151 to 
amplify VH251-gammal transcripts, reaction 2 used o-149 and o- 
152 to amplify VH251-mu transcripts, reaction 3 used o-150 and 
o-151 to amplify VH105-gammal transcripts, and reaction 4 used 
O-150 and o-152 to amplify VH105-mu transcripts. The 
resulting 0.5 kb PCR products were isolated from an agarose 
gel; the n transcript products were more abundant than the y 
transcript products, consistent with the corresponding ELISA 
data (Fig. 34) . The PCR products were digested with Xhol and 
cloned into the plasmid pNN03. Double-stranded plasmid DNA 
was isolated from minipreps of nine clones from each of the 
four PCR amplifications and dideoxy sequencing reactions were 
performed. Two of the clones turned out to be deletions 
containing no D or J segments. These could not have been 
derived from normal RNA splicing products and are likely to 
have originated from deletions introduced during PCR 
amplification. One of the DNA samples turned out to be a 
mixture of two individual clones, and three additional clones 
did not produce readable DNA sequence (presumably because the 
DNA samples were not clean enough) . The DNA sequences of the 
VDJ joints from the remaining 30 clones are compiled in Table 
4. Each of the sequences are unique, indicating that no 
single pathway of gene rearrangement, or single clone of 
transgene expressing B-cells is dominant. The fact that no 
two sequences are alike is also an indication of the large 
diversity of immunoglobulins that can be expressed from a 
compact minilocus containing only 2 V segments, 10 D segments, 
and 6 J segments. Both of the V segments, all six of the J 
segments, and 7 of the 10 D segments that are included in the 
transgene are used in VDJ joints. In addition, both constant 
region genes (mu and gammal) are incorporated into 
transcripts. The VH105 primer turned out not to be specific 
for VH105 in the reactions performed. Therefore many of the 
clones from reactions 3 and 4 contained VH251 transcripts. 
Additionally, clones isolated from ligated reaction 3 PCR 
product turned out to encode igM rather than IgG; however this 
may reflect contamination with PCR product from reaction 4 as 
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the DNA was isolated on the same gel. An analogous 
experiment, in which immunoglobulin heavy chain sequences were 
amplified from adult human peripheral blood lymphocytes (PBL) , 
and the DNA sequence of the VDJ joints determined, was 
recently reported by Yamada et al. ( J. Exp. Med . 173 : 395-407 
(1991) , which is incorporated herein by reference) . We 
compared the data from human PBL with our data from the pHCl 
transgenic mouse. 
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G. J segment choice 

Table 5 compared the distribution of J segments 
incorporated into pHCl transgene encoded transcripts to J 
segments found in adult human PBL immunoglobulin transcripts. 
The distribution profiles are very similar, J 4 is the dominant 
segment in both systems, followed by J6 . J2 is the least 
common segment in human PBL and the transgenic animal. 



TABLE 5 
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J. Segment 

Jl 
J2 
J3 
J4 
J5 
J6 



J. Segment Choice 

Percent Usage (± 3%) 
HC1 transgenic Human PBL 



7 
3 
17 
44 
3 
26 



100% 



1 
<1 

9 
53 
15 
22 



100^ 



H. D segment choice 

25 491 (40 of 82) of the clones analyzed by Yamada et 

al. incorporated D segments that are included in the pHCl 
transgene. An additional 11 clones contained sequences that 
were not assigned by the authors to any of the known D 
segments. Two of these 11 unassigned clones appear to be 

3 0 derived from an inversion of the DIR2 segments which is 

included in the pHCl construct. This mechanism, which was 
predicted by Ichihara et al. (EMBO_J. 7:4141 (1988)) and 
observed by Sanz f J. Immunol . 147 ; 1720-1729 (1991)), was not 
considered by Yamada et al. f J. Exp. Med . 173 :395-407 (1991)). 

3 5 Table 5 is a comparison of the D segment distribution for the 
pHCl transgenic mouse and that observed for human PBL 
transcripts by Yamada et al. The data of Yamada et al. was 
recompiled to include DIR2 use, and to exclude D segments that 
are not in the pHCl transgene. Table 6 demonstrates that the 

40 distribution of D segment incorporation is very similar in the 
transgenic mouse and in human PBL. The two dominant human D 
segments, DXP'l and DN1, are also found with high frequency in 
the transgenic mouse. The most dramatic dissimilarity between 
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the two distributions is the high frequency of DHQ52 in the 
transgenic mouse as compared to the human. The high frequency 
of DHQ52 is reminiscent of the D segment distribution in the 
human fetal liver. Sanz has observed that 14% of the heavy 
chain transcripts contained DHQ52 sequences. If D segments 
not found in pHCl are excluded from the analysis, 31% of the 
fetal transcripts analyzed by Sanz contain DHQ52. This is 
comparable to the 27% that we observe in the pHCl transgenic 
mouse. 

TABLE 6 D Segment Choice 

Percent Usage (± 3%) 
D. Segment HC1 transgenic Human PBL 

15 DLR1 <1 <i 

DXP1 3 6 

DXP'l 25 19 

DAI <1 12 

DK1 7 12 

2 0 DN1 12 2 2 

DIR2 7 4 

DM2 <1 2 

DLR2 3 4 

DHQ52 2 6 2 

25 ? 17 17 

100% 100% 

30 I. Functionality of VDJ joints 

Table 7 shows the predicted amino acid sequences of 
the VDJ regions from 3 0 clones that were analyzed from the 
pHCl transgenic. The translated sequences indicate that 2 3 of 
the 3 0 VDJ joints (77%) are in-frame with respect to the 

3 5 variable and J segments. 
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J. CDR3 length distribution 

Table 8 compared the length of the CDR3 peptides 
from transcripts with in-frame VDJ joints in the pHCl 
transgenic mouse to those in human PBL. Again the human PBL 
5 data comes from Yamada et al. The profiles are similar with 
the transgenic profile skewed slightly toward smaller CDR3 
peptides than observed from human PBL. The average length of 
CDR3 in the transgenic mouse is 10.3 amino acids. This is 
substantially the same as the average size reported for 
10 authentic human CDR3 peptides by Sanz ( J. Immunol . 147 : 1720- 
1729 (1991) ) . 

TABLE 8 CDR3 Length Distribution 

15 

Percent Occurrence (± 3%) 



25 



#amino acids in CDR3 


HC1 transgenic 


Human PBL 


3-8 


26 


14 


9-12 


48 


41 


13-18 


26 


37 


19-23 


<1 


7 


>23 


<1 


1 




100% 


100% 



EXAMPLE 13 
3 0 Rearranged Heavy Chain Transaenes 

A. Isolation of Rearranged Human Heavy Chain VDJ seg ments. 

Two human leukocyte genomic DNA libraries cloned 
into the phage vector XEMBL3/ SP6/T7 (Clonetech Laboratories, 
Inc., Palo Alto, CA) are screened with a 1 kb Pacl/Hindlll 
3 5 fragment of Xl.3 containing the human heavy chain J-/x intronic 
enhancer. Positive clones are tested for hybridization with a 
mixture of the following V H specific oligonucleotides: 

oligo-7 5' -tea gtg aag gtt tec tgc aag gca tct gga tac acc 
40 ttc acc-3 1 

oligo-8 5 f -tcc ctg aga etc tec tgt gca gec tct gga ttc acc 
ttc agt-3 1 
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Clones that hybridized with both V and J-/z probes 
are isolated and the DNA sequence of the rearranged VDJ 
segment determined . 

B. Construction of rearranged human heavy chain transgenes 
Fragments containing functional VJ segments (open 
reading frame and splice signals) are subcloned into the 
plasmid vector pSP72 such that the plasmid derived Xhol site 
is adjacent to the 5' end of the insert sequence. A subclone 
containing a functional VDJ segment is digested with Xhol and 
Pad (Pad, a rare-cutting enzyme, recognizes a site near the 
J-m intronic enhancer) , and the insert cloned into XhoI/PacI 
digested pHC2 to generate a transgene construct with a 
functional VDJ segment, the J-fi intronic enhancer, the \i 
switch element, the fi constant region coding exons, and the 7I 
constant region, including the sterile transcript associated 
sequences, the 7I switch, and the coding exons. This 
transgene construct is excised with NotI and microinjected 
into the pronuclei of mouse embryos to generate transgenic 
animals as described above. 

EXAMPLE 14 

Light Chain Transgenes 

A. Construction of Plasmid vectors 

1. Plasmid vector pGPlc 

Plasmid vector pGPla is digested with NotI and the 
following oligonucleotides ligated in: 

oligo-81 5'-ggc cgc ate ccg ggt etc gag gtc gac aag ctt teg 
agg ate cgc-3 1 

oligo-82 5'-ggc cgc gga tec teg aaa get tgt cga cct cga gac 
ccg gga tgc-3 1 

The resulting plasmid, pGPlc, contains a poly linker with Xmal, 
Xhol, Sail, Hindlll, and BamHI restriction sites flanked by 
NotI sites. 
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2 . Plasmid vector pGPld 

Plasmid vector pGPla is digested with NotI and the 
following oligonucleotides ligated in: 

oligo-87 5'-ggc cgc tgt cga caa get tat cga tgg ate etc gag 
tgc -3 1 

oligo-88 5 f »ggc cgc act cga gga tec ate gat aag ctt gtc gac 
age -3 ' 

The resulting plasmid, pGPld, contains a polylinker with Sail, 
Hindlll, Clal, BamHI , and Xhol restriction sites flanked by 
NotI sites. 

B. Isolation of Jk and Ck clones 

A human placental genomic DNA library cloned into 
the phage vector XEMBL3 /SP6/T7 (Clonetech Laboratories, Inc., 
Palo Alto, CA) was screened with the human kappa light chain J 
region specific oligonucleotide: 

oligo-36 5 1 - cac ctt egg cca agg gac acg act gga gat taa acg 
taa gca -3 1 

and the phage clones 136.2 and 13 6.5 isolated. A 7 . 4 kb Xhol 
fragment that includes the J/cl segment was isolated from 
13 6.2 and subcloned into the plasmid pNN03 to generate the 
plasmid clone p3 6.2. A neighboring 13 kb Xhol fragment that 
includes Jk segments 2 through 5 together with the Ck gene 
segment was isolated from phage clone 136.5 and subcloned into 
the plasmid pNN03 to generate the plasmid clone p3 6.5. 
Together these two clones span the region beginning 7.2 kb 
upstream of J/cl and ending 9 kb downstream of Ck. 

C. Construction of rearranged light chain transaenes 

1. pCKl, a Ck vector for expressing rearranged variable 
segments 

The 13 kb Xhol insert of plasmid clone p36.5 
containing the Ck gene, together with 9 kb of downstream 
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sequences, is cloned into the Sail site of plasmid vector 
pGPlc with the 5" end of the insert adjacent to the plasmid 
Xhol site. The resulting clone, pCKl can accept cloned 
fragments containing rearranged VJk segments into the unique 
5 1 Xhol site. The transgene can then be excised with NotI and 
purified from vector sequences by gel electrophoresis. The 
resulting transgene construct will contain the human J-C/c 
intronic enhancer and may contain the human 3' k enhancer. 

2 . pCK2 , a Ck vector with heavy chain enhancers for 
expressing rearranged variable segments 

A 0.9 kb Xbal fragment of mouse genomic DNA 
containing the mouse heavy chain J-jU intronic enhancer (J . 
Banerji et al., Cell 3J3:729-740 (1983)) was subcloned into 
pUC18 to generate the plasmid pJH22.1. This plasmid was 
linearized with SphI and the ends filled in with Klenow 
enzyme. The Klenow treated DNA was then digested with Hindlll 
and a 1.4 kb Mlul/Hindlll fragment of phage clone Xl.3 
(previous example) , containing the human heavy chain J-m 
intronic enhancer (Hayday et al., Nature 307 :334-340 (1984)), 
to it. The resulting plasmid, pMHEl, consists of the mouse 
and human heavy chain J-jll intronic enhancers ligated together 
into pUC18 such that they are excised on a single 
BamHI/Hindlll fragment. This 2.3 kb fragment is isolated and 
cloned into pGPlc to generate pMHE2 . pMHE2 is digested with 
Sail and the 13 kb Xhol insert of p3 6.5 cloned in. The 
resulting plasmid, pCK2 , is identical to pCKl, except that the 
mouse and human heavy chain J-ju intronic enhancers are fused 
to the 3 f end of the transgene insert. To modulate expression 
of the final transgene, analogous constructs can be generated 
with different enhancers, i.e. the mouse or rat 3 1 kappa or 
heavy chain enhancer (Meyer and Neuberger, EMBO J . , 
8.: 1959-1964 (1989); Petterson et al. , Nature, 34_4: 165-168 
(1990) ) . 

3. Isolation of rearranged kappa light chain variable 
segments 
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Two human leukocyte genomic DNA libraries cloned 
into the phage vector XEMBL3 /SP6/T7 (Clonetech Laboratories, 
Inc., Palo Alto, CA) were screened with the human kappa light 
chain J region containing 3.5 kb Xhol/Smal fragment of p36.5. 
5 Positive clones were tested for hybridization with the 
following V/c specific oligonucleotide: 

oligo-65 5'-agg ttc agt ggc agt ggg tct ggg aca gac ttc act 
etc acc ate agc-3 1 

10 

Clones that hybridized with both V and J probes are isolated 
and the DNA sequence of the rearranged VJ/c segment determined. 

4 . Generation of transgenic mice containing rearranged human 

15 light chain constructs. 

Fragments containing functional VJ segments (open 
reading frame and splice signals) are subcloned into the 
unique Xhol sites of vectors pCKl and pCK2 to generate 
rearranged kappa light chain transgenes. The transgene 

20 constructs are isolated from vector sequences by digestion 

with Notl. Agarose gel purified insert is microinjected into 
mouse embryo pronuclei to generate transgenic animals. 
Animals expressing human kappa chain are bred with heavy chain 
minilocus containing transgenic animals to generate mice 

25 expressing fully human antibodies. 

Because not all VJk combinations may be capable of 
forming stable heavy-light chain complexes with a broad 
spectrum of different heavy chain VDJ combinations, several 
different light chain transgene constructs are generated, each 

3 0 using a different rearranged VJk clone, and transgenic mice 
that result from these constructs are bred with heavy chain 
minilocus transgene expressing mice. Peripheral blood, 
spleen, and lymph node lymphocytes are isolated from double 
transgenic (both heavy and light chain constructs) animals, 

35 stained with fluorescent antibodies specific for human and 

mouse heavy and light chain immunoglobulins (Pharmingen, San 
Diego, CA) and analyzed by flow cytometry using a FACScan 
analyzer (Becton Dickinson, San Jose, CA) . Rearranged light 
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chain transgenes constructs that result in the highest level 
of human heavy/ light chain complexes on the surface of the 
highest number of B cells, and do not adversely affect the 
immune cell compartment (as assayed by flow cytometric 
analysis with B and T cell subset specific antibodies) , are 
selected for the generation of human monoclonal antibodies, 

D. Construction of unrearranaed light chain minilocus 
transgenes 

1. pJCKl, a Jk, Ck containing vector for constructing 
minilocus transgenes 

The 13 kb Ck containing Xhol insert of p3 6.5 is 
treated with Klenow enzyme and cloned into Hindlll digested, 
Klenow-treated, plasmid pGPld. A plasmid clone is selected 
such that the 5* end of the insert is adjacent to the vector 
derived Clal site. The resulting plasmid, p3 6.5-ld, is 
digested with Clal and Klenow-treated. The JkI containing 7.4 
kb Xhol insert of p3 6.2 is then Klenow-treated and cloned into 
the Clal, Klenow-treated p36.5-ld. A clone is selected in 
which the p3 6.2 insert is in the same orientation as the p36.5 
insert. This clone, pJCKl (Fig. 34), contains the entire 
human Jk region and Ck, together with 7.2 kb of upstream 
sequences and 9 kb of downstream sequences. The insert also 
contains the human J-C/c intronic enhancer and may contain a 
human 3' k enhancer. The insert is flanked by a unique 3' 
Sail site for the purpose of cloning additional 3 1 flanking 
sequences such as heavy chain or light chain enhancers. A 
unique Xhol site is located at the 5' end of the insert for 
the purpose of cloning in unrearranged V/c gene segments. The 
unique Sail and Xhol sites are in turn flanked by NotI sites 
that are used to isolate the completed transgene construct 
away from vector sequences. 

2. Isolation of unrearranged Vk gene segments and generation 
of transgenic animals expressing human Ig light chain protein 

The Vk specific oligonucleotide, oligo-65 (discussed 
above) , is used to probe a human placental genomic DNA library 
cloned into the phage vector 1EMBL3 / SP6/T7 (Clonetech 
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Laboratories, Inc., Palo Alto, CA) . Variable gene segments 
from the resulting clones are sequenced, and clones that 
appear functional are selected. Criteria for judging 
functionality include: open reading frames, intact splice 
acceptor and donor sequences, and intact recombination 
sequence, DNA fragments containing selected variable gene 
segments are cloned into the unique Xhol site of plasmid pJCKl 
to generate minilocus constructs. The resulting clones are 
digested with NotI and the inserts isolated and injected into 
mouse embryo pronuclei to generate transgenic animals. The 
transgenes of these animals will undergo V to J joining in 
developing B-cells. Animals expressing human kappa chain are 
bred with heavy chain minilocus containing transgenic animals 
to generate mice expressing fully human antibodies. 

EXAMPLE 15 
Genomic Heavy Chain Human la Transaene 

This Example describes the cloning of a human 
genomic heavy chain immunoglobulin transgene which is then 
introduced into the murine germline via microinjection into 
zygotes or integration in ES cells. 

Nuclei are isolated from fresh human placental 
tissue as described by Marzluff, W.F., et al. (1985), 
Transcription and Translation: A Practical Approach , B.D. 
Hammes and S.J. Higgins, eds. , pp. 89-129, IRL Press, Oxford). 

The isolated nuclei (or PBS washed human spermatocytes) are 
embedded in 0.5% low melting point agarose blocks and lysed 
with 1 mg/ml proteinase K in 500mM EDTA, 1% SDS for nuclei, or 
with lmg/ml proteinase K in 500mM EDTA, 1% SDS, lOmM DTT for 
spermatocytes at 50 °C for 18 hours. The proteinase K is 
inactivated by incubating the blocks in 40^g/ml PMSF in TE for 
30 minutes at 50°C, and then washing extensively with TE. The 
DNA is then digested in the agarose with the restriction 
enzyme NotI as described by M. Finney in Current Protocols in 
Molecular Biology (F. Ausubel et al., eds. John Wiley & Sons, 
Supp. 4, 1988, e.g., Section 2.5.1). 

The NotI digested DNA is then fractionated by pulsed 
field gel electrophoresis as described by Anand et al., Nuc. 
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Acids Res , 17.: 3425-3433 (1989). Fractions enriched for the 
NotI fragment are assayed by Southern hybridization to detect 
one or more of the sequences encoded by this fragment. Such 
sequences include the heavy chain D segments, J segments, and 
7I constant regions together with representatives of all 6 V H 
families (although this fragment is identified as 670 kb 
fragment from HeLa cells by Berman et al. (1988), supra., we 
have found it to be an 830 kb fragment from human placental 
and sperm DNA) . Those fractions containing this NotI 
fragment are ligated into the NotI cloning site of the vector 
pYACNN as described (Mccormick et al. , Technique 2:65-71 
(1990)). Plasmid pYACNN is prepared by digestion of pYACneo 
(Clontech) with EcoRI and ligation in the presence of the 
oligonucleotide 5 ' - AAT TGC GGC CGC - 3 ' . 

YAC clones containing the heavy chain NotI fragment 
are isolated as described by Traver et al., Proc. Natl. Acad. 
Sci. USA , 8(5:5898-5902 (1989). The cloned NotI insert is 
isolated from high molecular weight yeast DNA by pulse field 
gel electrophoresis as described by M. Finney, op. cit. The 
DNA is condensed by the addition of 1 mM spermine and 
microinjected directly into the nucleus of single cell embryos 
previously described. Alternatively, the DNA is isolated by 
pulsed field gel electrophoresis and introduced into ES cells 
by lipofection (Gnirke et al. , EMBO J . 10:1629-1634 (1991)), 
or the YAC is introduced into ES cells by spheroplast fusion. 

EXAMPLE 16 

Discontinuous Genomic Heavy Chain la Transcrene 

An 85 kb Spel fragment of human genomic DNA, 
containing V H 6, D segments, J segments, the constant region 
and part of the 7 constant region, has been isolated by YAC 
cloning essentially as described in Example 1. A YAC carrying 
a fragment from the germline variable region, such as a 57 0 kb 
NotI fragment upstream of the 670-83 0 kb NotI fragment 
described above containing multiple copies of V x through V 5 ^ is 
isolated as described. (Berman et al. (1988), supra, detected 
two 57 0 kb NotI fragments, each containing multiple V 
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segments.) The two fragments are coinjecteci into the nucleus 
of a mouse single cell embryo as described in Example 1. 

Typically, coinjection of two different DNA 
fragments result in the integration of both fragments at the 
5 same insertion site within the chromosome. Therefore, 

approximately 5 0% of the resulting transgenic animals that 
contain at least one copy of each of the two fragments will 
have the V segment fragment inserted upstream of the constant 
region containing fragment. Of these animals, about 50% will 

10 carry out V to DJ joining by DNA inversion and about 5 0% by 
deletion, depending on the orientation of the 570 kb NotI 
fragment relative to the position of the 85 kb Spel fragment. 
DNA is isolated from resultant transgenic animals and those 
animals found to be containing both transgenes by Southern 

15 blot hybridization (specifically, those animals containing 
both multiple human V segments and human constant region 
g enes ) are tested for their ability to express human 
immunoglobulin molecules in accordance with standard 
techniques. 

20 

EXAMPLE 17 

Identification of functionally rearranged variable region 
sequences in transgenic B cells 

An antigen of interest is used to immunize (see 

25 Harlow and Lane, Antibodies: A Laboratory Manual , Cold Spring 
Harbor, New York (1988)) a mouse with the following genetic 
traits: homozygosity at the endogenous having chain locus for 
a deletion of J H (Examples 10) ; hemizygous for a single copy 
of unrearranged human heavy chain minilocus transgene 

3 0 (examples 5 and 14) ; and hemizygous for a single copy of a 
rearranged human kappa light chain transgene (Examples 6 and 
14). 

Following the schedule of immunization, the spleen 
is removed, and spleen cells used to generate hybridomas. 
35 Cells from an individual hybridoma clone that secretes 

antibodies reactive with the antigen of interest are used to 
prepare genomic DNA. A sample of the genomic DNA is digested 
with several different restriction enzymes that recognize 
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unique six base pair sequences, and fractionated on an agarose 
gel. Southern blot hybridization is used to identify two DNA 
fragments in the 2-10 kb range, one of which contains the 
single copy of the rearranged human heavy chain VDJ sequences 
and one of which contains the single copy of the rearranged 
human light chain VJ sequence. These two fragments are size 
fractionated on agarose gel and cloned directly into pUC18. 
The cloned inserts are then subcloned respectively into heavy 
and light chain expression cassettes that contain constant 
region sequences . 

The plasmid clone pYel (Example 12) is used as a 
heavy chain expression cassette and rearranged VDJ sequences 
are cloned into the Xhol site. The plasmid clone pCKl is used 
as a light chain expression cassette and rearranged VJ 
sequences are cloned into the Xhol site. The resulting clones 
are used together to transfect SP 0 cells to produce antibodies 
that react with the antigen of interest (Co. et al., Proc. 
Natl. Acad. Sci. USA 88:2869 (1991), which is incorporated 
herein by reference) . 

Alternatively, mRNA is isolated from the cloned 
hybridoma cells described above, and used to synthesize cDNA. 
The expressed human heavy and light chain VDJ and VJ sequence 
are then amplified by PCR and cloned (Larrick et al., Biol. 
Technology , 7:934-938 (1989)). After the nucleotide sequence 
of these clones has been determined, oligonucleotides are 
synthesized that encode the same polypeptides, and synthetic 
expression vectors generated as described by Queen et al., 
Proc. Natl. Acad. Sci. USA. , 84:5454-5458 (1989). 

Immunization of Transgenic Animals with Complex Antigens 
The following experiment demonstrates that 

transgenic animals can be successfully immunized with complex 

antigens such as those on human red blood cells and respond 

with kinetics that are similar to the response kinetics 

observed in normal mice. 

Blood cells generally are suitable immunogens and 

comprise many different types of antigens on the surface of 

red and white blood cells. 
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Immunization with human blood 

Tubes of human blood from a single donor were 
collected and used to immunize transgenic mice having 
functionally disrupted endogenous heavy chain loci (J H D) and 
5 harboring a human heavy chain minigene construct (HC1) ; these 
mice are designated as line 112. Blood was washed and 
resuspended in 50 mis Hanks' and diluted to 1x10 s cells/ml 0.2 
mis (2xl0 7 cells) were then injected interperitoneally using a 
28 gauge needle and 1 cc syringe. This immunization protocol 

10 was repeated approximately weekly for 6 weeks. Serum titers 
were monitored by taking blood from retro-orbital bleeds and 
collecting serum and later testing for specific antibody. A 
pre-immune bleed was also taken as a control. On the very 
last immunization, three days before these animals were 

15 sacrificed for serum and for hybridomas, a single immunization 
of 1 x 10 7 cells was given intravenously through the tail to 
enhance the production of hybridomas. 

Table 9 

20 Animals 





Mouse ID 


Line 


Sex 


HC1-112 


JHD 


1 


2343 


112 


M 


+ 


++ 


2 


2344 


112 


M 




+ 


3 


2345 


112 


F 




+ 


4 


2346 


112 


F 




++■ 


5 


2347 


112 • 


F 




++ 


6 


2348 


112 


F 


+ 


++ 


7 


2349 


112 


F 




+ 
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Mice # 2 34 3 and 23 48 have a desired phenotype: human heavy 
chain mini-gene transgenic on heavy chain knock-out 
background . 

35 

Generation of Hvbridomas 

Hybridomas were generated by fusing mouse spleen 
cells of approximately 16 week-old transgenic mice (Table 9) 



161 

that had been immunized as described (supra) to a fusion 
partner consisting of the non-secreting HAT-sensitive myeloma 
cell line, X63 Ag8.653. Hybridoma clones were cultivated and 
hybridoma supernatants containing immunoglobulins having 
5 specific binding affinity for blood cell antigens were 
identified, for example, by flow cytometry. 

Flow cytometry 

Serum and hybridoma supernatants were tested using 

10 flow cytometry. Red blood cells from the donor were washed 4X 
in Hanks 1 balanced salt solution and 50,000 cells were placed 
in 1.1 ml polypropylene microtubes. Cells were incubated with 
antisera or supernatant from the hybridomas for 3 0 minutes on 
ice in staining media (Ix RPMI 164 0 media without phenol red 

15 or biotin (Irvine Scientific) 3% newborn calf serum, 0.1% Na 
azide) . Controls consisted of littermate mice with other 
genotypes. Cells were then washed by centrifugation at 4°C in 
Sorvall RT600B for 5-10 minutes at 1000 rpm. Cells were 
washed two times and then antibody detected on the cell 

20 surface with a fluorescent developing reagent. Two monoclonal 
reagents were used to test. One was a FITC-labeled mouse 
anti-human m heavy chain antibody (Pharmagen, San Diego, CA) 
and the other was a PE-labeled rat anti-mouse kappa light 
chain (Becton-Dickenson, San Jose, CA) . Both of these 

25 reagents gave similar results. Whole blood (red blood cells 
and white blood cells) and white blood cells alone were used 
as target cells. Both sets gave positive results. 

Serum of transgenic mice and littermate controls was 
incubated with either red blood cells from the donor, or white 

3 0 blood cells from another individual, washed and then developed 
with anti-human IgM FITC labeled antibody and analyzed in a 
flow cytometer. Results showed that serum from mice that are 
transgenic for the human mini-gene locus (mice 2343 and 2348) 
show human IgM reactivity whereas all littermate animals 

35 (2344, 2345, 2346, 2347) do not. Normal mouse serum (NS) and 
phosphate buffer saline (PBS) were used as negative controls. 
Red blood cells were ungated and white blood cells were gated 
to include only lymphocytes. Lines are drawn on the x and y 
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axis to provide a reference. Flow cytometry was performed on 
100 supernatants from fusion 2348. Four supernatants showed 
positive reactivity for blood cell antigens. 

EXAMPLE 18 

Reduction of Endogenous Mouse Immunoglobulin Expression 
bv Antisense RNA 

A. Vector for Expression of Antisense Ig Sequences 
1. Construction of the cloning vector pGPlh 
The vector pGPlb (referred to in a previous example) 
is digested with Xhol and BamHI and ligated with the following 
oligonucleotides : 

5 i _ g a t cct cga gac cag gta cca gat ctt gtg aat teg -3 1 
5 1 - teg acg aat tea caa gat ctg gta cct ggt etc gag -3 1 

to generate the plasmid pGPlh. This plasmid contains a 
polylinker that includes the following restriction sites: 
NotI, EcoRI, Bglll, Asp718, Xhol, BamHI, Hindlll, NotI . 

Construction of pBCEl. 

A 0.8 kb Xbal/Bglll fragment of pVH251 (referred to 
in a previous example) , that includes the promoter leader 
sequence exon, first intron, and part of the second exon of 
the human VH-V family immunoglobulin variable gene segment, 
was inserted into Xbal/Bglll digested vector pNN03 to generate 
the plasmid pVH251. 

The 2.2 kb BamHI / EcoRI DNA fragment that includes 
the coding exons of the human growth hormone gene (hGH; 
Seeburg, (1982) DNA 1:239-249) is cloned into Bglll/EcoRI 
digested pGHlh. The resulting plasmid is digested with BamHI 
and the BamHI/Bglll of pVH251N is inserted in the same 
orientation as the hGH gene to generate the plasmid pVhgh. 

A 0.9 kb Xbal fragment of mouse genomic DNA 
containing the mouse heavy chain J-ju intronic enhancer 
(Banerji et al., (1983) Cell 33:729-740) was subcloned into 
pUC18 to generate the plasmid pJH2 2.1. This plasmid was 
linearized with SphI and the ends filled in with klenow 



163 

enzyme. The klenow treated DNA was then digested with Hindlll 
and a 1.4 kb Mlul (klenow) /Hindlll fragment of phage clone Al.3 
(previous example) , containing the human heavy chain J-/x 
intronic enhancer (Hayday et al. , (1984) Nature 307:334-340), 
5 to it. The resulting plasmid, pMHEl, consists of the mouse 
and human heavy chain J-/x intron enhancers ligated together 
into pUC18 such that they can be excised on a single 
BamHI /Hindlll fragment . 

The BamHI/Hindlll fragment of pMHEl is cloned into 

10 BamHI /Hindlll cut pVhgh to generate the B-cell expression 
vector pBCEl. This vector, depicted in Fig. 36, contains 
unique Xhol and Asp718 cloning sites into which antisense DNA 
fragments can be cloned. The expression of these antisense 
sequences is driven by the upstream heavy chain promoter- 

15 enhancer combination the downstream hGH gene sequences provide 
polyadenylation sequences in addition to intron sequences that 
promote the expression of transgene constructs. Antisense 
transgene constructs generated from pBCEl can be separated 
from vector sequences by digestion with NotI . 

20 

B. An IgM antisense transgene construct. 

The following two oligonucleotides: 

5 1 - cgc ggt acc gag agt cag tec ttc cca aat gtc -3' 
25 5'- cgc etc gag aca get gga atg ggc aca tgc aga -3' 

are used as primers for the amplification of mouse IgM 
constant region sequences by polymerase chain reaction (PCR) 
using mouse spleen cDNA as a substrate. The resulting 0.3 kb 

3 0 PCR product is digested with Asp718 and Xhol and cloned into 

Asp718/XhoI digested pBCEl to generate the antisense transgene 
construct pMASl. The purified NotI insert of pMASl is 
microinjected into the pronuclei of half day mouse embryos — 
alone or in combination with one or more other transgene 

35 constructs — to generate transgenic mice. This construct 

expresses an RNA transcript in B-cells that hybridizes with 
mouse IgM mRNA, thus down-regulating the expression of mouse 
IgM protein. Double transgenic mice containing pMASl and a 



164 

human heavy chain transgene minilocus such as pHCl (generated 
either by coinjection of both constructs or by breeding of 
singly transgenic mice) will express the human transgene 
encoded Ig receptor on a higher percentage of B-cell than mice 
5 transgenic for the human heavy chain minilocus alone. The 

ratio of human to mouse Ig receptor expressing cells is due in 
part to competition between the two populations for factors 
and cells that promoter B-cell differentiation and expansion. 
Because the Ig receptor plays a key role in B-cell 
10 development, mouse Ig receptor expressing B-cells that express 
reduced levels of IgM on their surface (due to mouse Ig 
specific antisense down-regulation) during B-cell development 
will not compete as well as cells that express the human 
receptor. 

15 

C. An IgKappa antisense transgene construct. 

The following two oligonucleotides: 

5 1 - cgc ggt acc get gat get gca cca act gta tec -3 f 
2 0 5 1 - cgc etc gag eta aca etc att cct gtt gaa get -3' 

are used as primers for the amplification of mouse IgKappa 
constant region sequences by polymerase chain reaction (PCR) 
using mouse spleen cDNA as a substrate. The resulting 0.3 kb 

2 5 PCR product is digested with Asp718 and Xhol and cloned into 

Asp718/XhoI digested pBCEl to generate the antisense transgene 
construct pKASl. The purified NotI insert of pKASl is 
microinjected into the pronuclei of half day mouse embryos — 
alone or in combination with one or more other transgene 

3 0 constructs — to generate transgenic mice. This construct 

expresses an RNA transcript in B-cells that hybridizes with 
mouse IgK mRNA, thus down-regulating the expression of mouse 
IgK protein as described above for pMASl. 

35 EXAMPLE 19 

This example demonstrates the successful 
immunization and immune response in a transgenic mouse of the 
present invention . 
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Immunization of Mice 

Keyhole limpet hemocyanin conjugated with greater 
than 400 dinitrophenyl groups per molecule (Calbiochem, La 
Jolla, California) (KLH-DNP) was alum precipitated according 
5 to a previously published method (Practical Immunology, L. 
Hudson and F.C. Hay, Blackwell Scientific (Pubs.)/ P- 9/ 
1980) . Four hundred m<? of alum precipitated KLH-DNP along 
with 100 jug dimethyldioctadecyl Ammonium Bromide in 10 0 juL of 
phosphate buffered saline (PBS) was injected intraperitoneally 
10 into each mouse. Serum samples were collected six days later 
by retro-orbital sinus bleeding. 

Analysis of Human Antibody Reactivity in Serum 

Antibody reactivity and specificity were assessed 

15 using an indirect enzyme-linked immunosorbent assay (ELISA) . 
Several target antigens were tested to analyze antibody 
induction by the immunogen. Keyhole limpet hemocyanin 
(Calbiochem) was used to identify reactivity against the 
protein component, bovine serum albumin-DNP for reactivity 

2 0 against the hapten and/or modified amino groups, and KLH-DNP 
for reactivity against the total immunogen. Human antibody 
binding to antigen was detected by enzyme conjugates specific 
for IgM and IgG sub-classes with no cross reactivity to mouse 
immunoglobulin. Briefly, PVC microtiter plates were coated 

2 5 with antigen drying overnight at 3 7 °C of 5 /xg/mL protein in 

PBS. Serum samples diluted in PBS, 5% chicken serum, 0.5% 
Tween-20 were incubated in the wells for 1 hour at room 
temperature, followed by anti-human IgG Fc and IgG F(ab')- 
horseradish peroxidase or anti-human IgM Fc-horseradish 

3 0 peroxidase in the same diluent. After 1 hour at room 

temperature enzyme activity was assessed by addition of ABTS 
substrate (Sigma, St. Louis, Missouri) and read after 3 0 
minutes at 415-490 nm. 

35 Human Heavy Chain Participation in Immune Response in 
Transgenic Mice 

Figures 37A-37D illustrate the response of three 

mouse littermates to immunization with KLH-DNP. Mouse number 
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12 9 6 carried the human IgM and IgG unrearranged transgene and 
was homozygous for mouse Ig heavy chain knockout. Mouse 
number 1299 carried the transgene on a non-knockout 
background, while mouse 1301 inherited neither of these sets 
5 of genes. Mouse 1297, another littermate, carried the human 
transgene and was hemizygous with respect to mouse heavy chain 
knockout. It was included as a non- immunized control. 

The results demonstrate that both human IgG and IgM 
responses were developed to the hapten in the context of 
10 conjugation to protein. Human IgM also developed to the KLH 
molecule, but no significant levels of human IgG were present 
at this time point. In pre- immunization serum samples from 
the same mice, titers of human antibodies to the same target 
antigens were insignificant. 

15 

EXAMPLE 2 0 

This example demonstrates the successful 
immunization with a human antigen and immune response in a 
transgenic mouse of the present invention, and provides data 
2 0 demonstrating that nonrandom somatic mutation occurs in the 
variable region sequences of the human transgene. 

Demonstration of antibody responses comprising human 
immunoglobulin heavy chains against a human glycoprotein 
25 antigen 

Transgenic mice used for the experiment were 
homozygous for functionally disrupted murine immunoglobulin 
heavy chain loci produced by introduction of a transgene at 
the joining (J) region (supra) resulting in the absence of 

30 functional endogenous (murine) heavy chain production. The 
transgenic mice also harbored at least one complete 
unrearranged human heavy chain mini-locus transgene, (HC1, 
supra ) , which included a single functional V H gene (V H 2 51) , 
human jLt constant region gene, and human yl constant region 

35 gene. Transgenic mice shown to express human immunoglobulin 
transgene products ( supra ) were selected for immunization with 
a human antigen to demonstrate the capacity of the transgenic 
mice to make an immune response against a human antigen 
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immunization. Three mice of the HC1-2 6 line and three mice of 
the HC1-57 line (supra) were injected with human antigen. 

One hundred /xg of purified human carcinoembryonic 
antigen (CEA) insolubilized on alum was injected in complete 
5 Freund's adjuvant on Day 0, followed by further weekly 

injections of alum-precipitated CEA in incomplete Freund's 
adjuvant on Days 7, 14, 21, and 28. Serum samples were 
collected by retro-orbital bleeding on each day prior to 
injection of CEA* Equal volumes of serum were pooled from 

10 each of the three mice in each group for analysis. 

Titres of human /x chain-containing immunoglobulin 
and human y chain-containing immunoglobulin which bound to 
human CEA immobilized on microtitre wells were determined by 
ELISA assay. Results of the ELISA assays for human /x chain- 

15 containing immunoglobulins and human 7 chain-containing 

immmunoglbulins are shown in Figs. 38 and 39, respectively. 
Significant human \i chain Ig titres were detected for both 
lines by Day 7 and were observed to rise until about Day 21. 
For human 7 chain Ig, significant titres were delayed, being 

20 evident first for line HC1-57 at Day 14, and later for line 
HC1-2 6 at Day 21. Titres for human 7 chain Ig continued to 
show an increase over time during the course of the 
experiment. The observed human (jl chain Ig response, followed 
by a plateau, combined with a later geveloping 7 chain 

25 response which continues to rise is characteristic of the 
pattern seen with affinity maturation. Analysis of Day 21 
samples showed lack of reactivity to an unrelated antigen, 
keyhole limpet hemocyanin (KLC) , indicating that the antibody 
response was directed against CEA in a specific manner. 

3 0 These data indicate that animals transgenic for 

human unrearranged immunoglobulin gene loci: (1) can respond 
to a human antigen (e.g., the human glycoprotein, CEA), (2) 
can undergo isotype switching ( "class switching) as 
exemplified by the observed /x to 7 class switch, and (3) 

3 5 exhibit characteristics of affinity maturation in their 

humoral immune responses. In general, these data indicate: 
(1) the human Ig transgenic mice have the ability to induce 
heterologous antibody production in response to a defined 
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antigen, (2) the capacity of a single transgene heavy chain 
variable region to respond to a defined antigen, (3) response 
kinetics over a time period typical of primary and secondary 
response development, (4) class switching of a transgene- 
encoded humoral immune response from IgM to IgG, and (5) the 
capacity of transgenic animal to produce human-sequence 
antibodies against a human antigen. 

Demonstration of somatic mutation in a human heavy chain 
transgene minilocus . 

Line HC1-57 transgenic mice, containing multiple 
copies of the HC1 transgene, were bred with immunoglobulin 
heavy chain deletion mice to obtain mice that contain the HC1 
transgene and contain disruptions at both alleles of the 
endogenous mouse heavy chain ( supra ) . These mice express 
human mu and gammal heavy chains together with mouse kappa and 
lambda light chains f supra ) . One of these mice was 
hyper immunized against human carcinoembryonic antigen by 
repeated intraperitoneal injections over the course of 1.5 
months. This -mouse was sacrificed and lymphoid cells isolated 
from the spleen, inguinal and mesenteric lymph nodes, and 
peyers patches. The cells were combined and total RNA 
isolated. First strand cDNA was synthesized from the RNA and 
used as a template for PCR amplification with the following 2 
oligonucleotide primers: 

149 5* -eta get cga gtc caa gga gtc tgt gec gag gtg cag ctg 
(g/a/t/c)-3« 

151 5'-ggc get cga gtt cca cga cac cgt cac egg ttc-3 1 

These primers specifically amplify VH251/gammal cDNA 
sequences. The amplified sequences were digested with Xhol 
and cloned into the vector pNN03. DNA sequence from the 
inserts of 2 3 random clones is shown in Fig. 40; sequence 
variations from germline sequence are indicated, dots indicate 
sequence is identical to germline. Comparison of the cDNA 
sequences with the germline sequence of the VH2 51 transgene 



reveals that 3 of the clones are completely unmutated, while 
the other 2 0 clones contain somatic mutations. One of the 3 
non-mutated sequences is derived from an out-of-frame VDJ 
joint. Observed somatic mutations at specific positions of 
occur at similar frequencies and in similar distribution 
patterns to those observed in human lymphocytes (Cai et al. 
(1992) J . Exp . Med . 176 : 1073, incorporated herein by 
reference) . The overall frequency of somatic mutations is 
approximately 1%; however, the frequency goes up to about 5% 
within CDR1, indicating selection for amino acid changes that 
affect antigen binding. This demonstrates antigen driven 
affinity maturation of the human heavy chain sequences. 

EXAMPLE 21 

This example demonstrates the successful formation 
of a transgene by co- introduction of two separate 
polynucleotides which recombine to form a complete human light 
chain minilocus transgene. 

Generation of an unrearranaed light chain minilocus transgene 
by co-iniection of two overlapping DNA fragments 
1. Isolation of unrearranaed functional V 1 C gene segments 
vk65.3, vk65.5. vk65.8 and vk65.15 

The V K specific oligonucleotide, oligo-65 (5'-agg 
ttc agt ggc agt ggg tct ggg aca gac ttc act etc acc ate agc- 
3 1 ), was used to probe a human placental genomic DNA library 
cloned into the phage vector XEMBL3 /SP6/T7 (Clonetech 
Laboratories, Inc., Palo Alto, CA) . DNA fragments containing 
V K segments from positive phage clones were subcloned into 
plasmid vectors. Variable gene segments from the resulting 
clones are sequenced, and clones that appear functional were 
selected. Criteria for judging functionality include: open 
reading frames, intact splice acceptor and donor sequences, 
and intact recombination sequence. DNA sequences of 4 
functional V K gene segments (vk65.3, vk65.5, vk65.8, and 
vk65.15) from 4 different plasmid clones isolated by this 
procedure are shown in Figs. 41-44. The four plasmid clones, 
p65.3f, p65.5gl, p65.8, and p65.15f, are described below. 
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(1 a) p65.3f 

A 3 kb Xba fragment of phage clone \65.3 was 
subcloned into pUC19 so that the vector derived Sail site was 
proximal to the 3 1 end of the insert and the vector derived 
BamHI site 5 1 . The 3 kb BamHI/Sall insert of this clone was 
subcloned into pGPlf to generate p65.3f. 

(1 b) p65.5gl 

A 6.8 kb EcoRI fragment of phage clone \65.5 was 
subcloned into pGPlf so that the vector derived Xhol site is 
proximal to the 5 1 end of the insert and the vector derived 
Sail site 3'. The resulting plasmid is designated p65.5gl. 

(1 c) p65.8 

A 6.5 kb Hindlll fragment of phage clone \65.8 was 
cloned into pSP72 to generate p65.8. 

(1 d) p65.15f 

A 10 kb EcoRI fragment of phage clone X65.16 was 
subcloned into pUC18 to generate the plasmid p65.15.3. The V K 
gene segment within the plasmid insert was mapped to a 4 . 6 kb 
EcoRI/Hindlll subfragment, which was cloned into pGPlf . The 
resulting clone, p65.15f, has unique Xhol and Sail sites 
located at the respective 5' and 3' ends of the insert. 

2. pKV4 

The Xhol /Sail insert of p65.8 was cloned into the 
Xhol site of p65.15f to generate the plasmid pKV2 . The 
Xhol /Sail insert of p65.5gl was cloned into the Xhol site of 
pKV2 to generate pKV3 . The Xhol/Sall insert of pKV3 was 
cloned into the Xhol site of p65.3f to generate the plasmid 
pKV4. This plasmid contains a single 21 kb Xhol/Sall insert 
that includes 4 functional V K gene segments. The entire 
insert can also be excised with Notl. 



3 . PKC1B 
(3 a) pKcor 
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Two Xhol fragments derived from human genomic DNA 
phage X clones were subcloned into plasmid vectors. The 
first, a 13 kb J K 2-J K 5/C K containing fragment, was treated with 
Klenow enzyme and cloned into Hindlll digested, Klenow 
treated, plasmid pGPld. A plasmid clone (pK-31) was selected 
such that the 5 f end of the insert is adjacent to the vector 
derived Clal site. The second Xhol fragment, a 7.4 kb piece 
of DNA containing J K 1 was cloned into XhoI/Sall-digested 
pSP72, such that the 3 f insert Xhol site was destroyed by 
ligation to the vector Sail site. The resulting clone, 
p3 6.2s, includes an insert derived Clal site 4 . 5 kb upstream 
of J K 1 and a polylinker derived Clal site downstream in place 
of the naturally occurring Xhol site between J K 1 and J K 2. This 
clone was digested with Clal to release a 4.7 kb fragment 
which was cloned into Clal digested pK-31 in the correct 5' to 
3 1 orientation to generate a plasmid containing all 5 human J K 
segments, the human intronic enhancer human C K , 4 . 5 kb of 5 1 
flanking sequence, and 9 kb of 3' flanking sequence. This 
plasmid, pKcor, includes unique flanking Xhol and Sail sites 
on the respective 5 1 and 3 1 sides of the insert. 

(3 b) pKcorB 

A 4 kb BamHI fragment containing the human 3 1 kappa 
enhancer (Judde, J.-G. and Max, E.E. (1992) Mol. Cel l. Biol. 
12: 5206, incorporated herein by reference) was cloned into 
pGPlf such that the 5' end is proximal to the vector Xhol 
site. The resulting plasmid, p24Bf , was cut with Xhol and the 
17.7 kb Xhol/Sall fragment of pKcor cloned into it in the same 
orientation as the enhancer fragment. The resulting plasmid, 
pKcorB, includes unique Xhol and Sail sites at the 5' and 3' 
ends of the insert respectively. 

(3 c) pKClB 

The Xhol/ Sail insert of pKcorB was cloned into the 
Sail site of p65.3f to generate the light-chain minilocus- 
transgene plasmid pKClB. This plasmid includes a single 
functional human V K segment, all 5 human J K segments, the human 



172 

intronic enhancer, human C K/ and the human 3 1 kappa enhancer. 
The entire 25 kb insert can be isolated by NotI digestion. 

4. Co4 

The two NotI inserts from plasmids pKV4 and pKClB 
were mixed at a concentration of 2.5 /xg/ml each in 
microinjection buffer, and co-injected into the pronuclei of 
half day mouse embryos as described in previous examples. 
Resulting transgenic animals contain transgene inserts 
(designated Co4, product of the recombination shown in Fig. 
45) in which the two fragments co-integrated. The 3 f 3 kb of 
the pKV4 insert and the 5'3 kb of the pKClB insert are 
identical. Some of the integration events will represent 
homologous recombinations between the two fragments over the 3 
kb of shared sequence. The Co4 locus will direct the 
expression of a repertoire of human sequence light chains in a 
transgenic mouse. 

EXAMPLE 22 

This example demonstrates the successful production 
of a murine hybridoma clone secreting a monoclonal antibody 
reactive with a specific immunogen, wherein the monoclonal 
antibody comprises a human immunoglobulin chain encoded by a 
human Ig transgene. 

Generation of Monoclonal Antibodies Incorporating Human Heavy 
Chain Transgene Product 

1. Immunization of Mouse Harboring Human Heavy Chain 
Transgene 

A mouse containing a human heavy chain encoding 
transgene and homozygous for knockout (i.e., functional 
disruption) of the endogenous heavy chain locus ( see , EXAMPLE 
20, supra) was immunized with purified human CEA, and spleen 
cells were subsequently harvested after a suitable immune 
response period. The murine spleen cells were fused with 
mouse myeloma cells to generate hybridomas using conventional 
techniques ( see , Kohler and Milstein, Eur. J. Immunol., 6.: 511- 
519 (1976); Harlow and Lane, Antibodies: A Laborat ory Manual, 
Cold Spring Harbor, New York (1988)). The mouse used for 
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immunization contained a human unrearranged heavy chain 
minilocus transgene which comprised a single functional V H 
gene (V H251 ) , human D and J segments, human \i constant region, 
and human 71 constant region genes. The transgenic line from 
which it originated was designated HC1-57 (supra) . 

One hundred of purified human carcinoembryonic 
antigen (CEA) (Cyrstal Chem, Chicago, IL or Scripps Labs, San 
Diego, CA) insolubilized on alum was injected in complete 
Freund's adjuvant on Day 0, followed by further weekly 
injections of alum-precipitated CEA in incomplete Freund's 
adjuvant on Days 7, 14, 21, and 28. An additional 20 jug of 
soluble CEA was administered intravenously on Day 83, followed 
by 50 fj,g alum-precipitated CEA in incomplete Freund's adjuvant 
on Day 92. Human heavy chain responses to CEA were confirmed 
in serum samples prior to fusion of spleen cells with myleoma 
cells. The animal was sacrificed on Day 95, the spleen 
removed and fused with P3X63-Ag8 . 653 mouse myeloma cells (ATCC 
CRL 1580, American Type Culture Collection, Rockville, MD) 
using polyethylene glycol. Two weeks later, supernates from 
fusion wells were screened for the presence of antibodies 
specifically reactive with CEA, and which contained human 
heavy chain fi or 7 constant region epitopes by ELISA. 
Briefly, purified human CEA was coated onto PVC microtitre 
plates at 2.5 nq/ml, and incubate with culture supernate 
diluted 1:4 or 1:5 in PBS, 0.5% Tween-2 0, 5% chicken serum. 
Plates were washed, followed by addition of horseradish 
peroxidase-conjugated goat antiserum specific for human IgG Fc 
or rabbit antiserum specific for human IgM FcSMu (Jackson 
ImmunoResearch, West Grove, PA) . Presence of conjugate bound 
to captured antibody was determined, after further washing, by 
the addition of ABTS substrate. Two independent fusion wells 
were found to contain antibody with substantial binding to 
CEA. After cloning, both hybridomas were found to be positive 
for the presence of human fi chain and murine k chain by ELISA. 
No mouse IgG or IgM were detected using similar assays. 

Subcloning of the two independent parent hybridomas 
resulted in two clones, designated 92-09A-4F7-A5-2 and 92-09A- 
1D7-1-7-1. Both lines were deposited with the ATCC Patent 
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Culture Depository under the Budapest Treaty and were assigned 
ATCC Designation HB 11307 and HB 11308, respectively. Culture 
supernatants from these cell lines were assessed for 
specificity by testing for reactivity to several purified 
target proteins using ELISA. As shown in Fig. 46, EL1SA 
assays for determining the reactivity of the monoclonal 
antibodies to various antigens demonstrate that only CEA and 
the CEA-related antigen NCA-2 show significant reactivity, 
indicating the development of a restricted reactivity for the 
variable regions of the heterohybrid immunoglobulin molecules. 

EXAMPLE 2 3 

This example demonstrates that a rearranged human 
VDJ gene encoded by a human Ig mini locus transgene may be 
transcribed as a transcript which includes an endogenous Ig 
constant region gene, for example by the mechanism of trans- 
switching, to encode a chimeric human/mouse Ig chain. 
Identification of Trans-Switch Transcripts Encoding Chimeric 
Human-Mouse Heavy Chains 

RNA was isolated from a hyper immunized HC1 line 57 
transgenic mouse homozygous for the endogenous heavy chain J 
segment deletion (supra) . cDNA was synthesized according to 
Taylor et al. (1993) Nucleic Acids Res. 20: 6287, incorporated 
herein by reference, and amplified by PCR using the following 
two primers: 

o-149 (human V H2 si) • 

S'-CTA GCT CGA GTC CAA GGA GTC TGT GCC GAG GTG CAG CTG (G,A,T,C)-3' 

o-249 (mouse gamma) : 

5'-GGC GCT CGA GCT GGA CAG GG(A/C) TCC A(G/T) A GTT CCA-3 ' 

Oligonucleotide o-149 is specific for the HC1- 
encoded variable gene segment V H251 , while o-249 hybridizes to 
both mouse and human gamma sequences with the following order 
of specificities: 

mouse 7I = mouse 72b = mouse y3 > mouse 72a >> human 71. 
DNA sequences from 10 randomly chosen clones generated from 
the PCR products was determined and is shown in Fig. 47. Two 
clones comprised human VDJ and mouse 7I; four clones comprised 
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human VDJ and mouse 72b; and four clones comprised human VD J . 
and mouse 73 . These results indicate that in a fraction of 
the transgenic B cells, the transgene-encoded human VDJ 
recombined into the endogenous murine heavy chain locus by 
5 class switching or an analogous recombination. 

EXAMPLE 24 

This example describes a method for screening a pool 
of hybridomas to discriminate clones which encode chimeric 

10 human/mouse Ig chains from clones which encode and express a 
human Ig chain. For example, in a pool of hybridoma clones 
made from a transgenic mouse comprising a human Ig heavy chain 
transgene and homozygous for a J region-disrupted endogenous 
heavy chain locus, hybridoma clones encoding trans-switched 

15 human VDJ-murine constant region heavy chains may be 

identified and separated from hybridoma clones expressing 
human VDJ-human constant region heavy chains. 
Sceenina Hybridomas to Eliminate Chimeric Icr Chains 

The screening process involves two stages, which may 

20 be conducted singly or optionally in combination: (1) a 

preliminary ELISA-based screen, and (2) a secondary molecular 
characterization of candidate hybridomas. Preferably, a 
preliminary ELISA-based screen is used for initial 
identification of candidate hybridomas which express a human 

25 VDJ region and a human constant region. 

Hybridomas that show positive reactivity with the 
antigen (e.g., the immunogen used to elicit the antibody 
response in the transgenic mouse) are tested using a panel of 
monoclonal antibodies that specifically react with mouse jjl, 7, 

3 0 k , and X, and human /i, 7, and /c. Only hybridomas that are 

positive for human heavy and light chains, as well as negative 
for mouse chains, are identified as candidate hybridomas that 
express human immunoglobulin chains. Thus, candidate 
hybridomas are shown to have reactivity with specific antigen 

3 5 and to possess epitopes characteristic of a human constant 
region. 

RNA is isolated from candidate hybridomas and used 
to synthesize first strand cDNA. The first strand cDNA is 
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then ligated to a unique single-stranded oligonucleotide of 
predetermined sequence (oligo-X) using RNA ligase (which 
ligates single-stranded DNA) . The ligated cDNA is then 
amplified in two reactions by PCR using two sets of 
5 oligonucleotide primers. Set H (heavy chain) includes an 

oligo that specifically anneals to either human ju or human 7I 
(depending on the results of the ELISA) and an oligo that 
anneals to the oligo-X sequence. This prevents bias against 
detection of particular V segments, including mouse V segments 

10 that may have trans-rearranged into the human minilocus. A 
second set of primers, Set L (light chain) , includes an oligo 
that specifically anneals to human k and an oligo that anneals 
specifically to oligo-X. The PCR products are molecular ly 
cloned and the DNA sequence of several are determined to 

15 ascertain whether the hybridoma is producing a unique human 
antibody on the basis of sequence comparison to human and 
murine Ig sequences . 

EXAMPLE 25 

2 0 This example demonstrates production of a transgenic 

mouse harboring a human light chain (k) minilocus. 
Human k Minilocus transgenic mice 
KC1 

A 13 kb Xhol Jk2-K/c containing fragment from a phage 
25 clone (isolated from a human genomic DNA phage library by 
hybridization to a k specific oligonucleotide, e.g., supra) 
was treated with Klenow enzyme and cloned into the Klenow 
treated Hindlll site of pGPld to produce pK-31. This 
destroyed the insert Xhol sites and positioned the unique 
30 polylinker derived Xhol site at the 5' end next to Jk2. A 
unique polylinker derived Clal site is located between this 
Xhol site and the inset sequences, while a unique polylinker 
derived Sail site is located at the 3 1 end of the insert. A 
7.5 kb Xhol fragment, containing J/cl and upstream sequences, 
35 was also isolated from a human genomic DNA phage clone 
(isolated from a human genomic DNA phage library by 
hybridization to a k specific oligonucleotide, e.g. supra) . 
This 7.5 kb Xhol fragment was cloned into the Sail site of 
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pSP72 (Promega, Madison, Wisconsin) , thus destroying both Xhol 
sites and positioning a polylinker Clal site 3' of J/cl. 
Digestion of the resulting clone with Clal released a 4,7 kb 
fragment containing Jkl and 4.5 kb of upstream sequences. 
5 This 4.7 kb fragment was cloned into the Clal site of pK-31 to 
create pKcor. The remaining unique 5 f Xhol site is derived 
from polylinker sequences. A 6 . 5 kb Xhol/Sall DNA fragment 
containing the unrearranged human V/cIII gene segment 65.8 
(plasmid p65.8, EXAMPLE 21) was cloned into the Xhol site of 

10 pKcor to generate the plasmid pKCl. The NotI insert of pKCl 
was microinjected into 1/2 day mouse embryos to generate 
transgenic mice. Two independent pKCl derived transgenic 
lines were established and used to breed mice containing both 
heavy and light chain miniloci. These lines, KC1-67 3 and KC1- 

15 674, were estimated by Southern blot hybridization to contain 
integrations of approximately 1 and 10-2 0 copies of the 
transgenes respectively . 



KCle 

20 The plasmid pMHEl (EXAMPLES 13 and 18) was digested 

with BamHI and Hindlll to excise the 2.3 kb insert containing 
both the mouse and human heavy chain J-/x intronic enhancers. 
This fragment was Klenow treated, ligated to Sail linkers (New 
England Biolabs, Beverly, Massachusetts) , and cloned into the 

25 unique 3' Sail site of pKCl to generate the plasmid pKCle. 

The NotI insert of pKCle was microinjected into 1/2 day mouse 
embryos to generate transgenic mice. Four independent pKCle 
derived transgenic lines were established and used to breed 
mice containing both heavy and light chain miniloci. These 

30 lines, KCle-1399, KCle-1403, KCle-1527, and KCle-1536, were 
estimated by Southern blot hybridization to contain 
integrations of approximately 20-50, 5-10, 1-5, and 3-5 copies 
of the transgene, respectively. 



35 r>KC2 

A 6.8 kb Xhol /Sail DNA fragment containing the 
unrearranged human VkIII gene segment 65.5 (plasmid p65.5gl, 
EXAMPLE 21) was cloned into the unique 5' Xhol site of pKCl to 
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generate the plasmid pKC2 . This minilocus transgene contains 
two different functional VkIII gene segments. The NotI insert 
of pKC2 was microinjected into 1/2 day mouse embryos to 
generate transgenic mice. Five independent pKC2 derived 
transgenic lines were established and used to breed mice 
containing both heavy and light chain miniloci. These lines, 
KC2-1573, KC2-1579, KC2-1588, KC2-1608, and KC2-1610, were 
estimated by Southern blot hybridization to contain 
integrations of approximately 1-5, 10-50, 1-5, 50-100, and 5- 
2 0 copies of the transgene, respectively. 

EXAMPLE 2 6 

This example shows that transgenic mice bearing the 
human k transgene can make an antigen- induced antibody 
response forming antibodies comprising a functional human k 
chain. 

Antibody Responses Associated with Human Iq k Light Chain 

A transgenic mouse containing the HC1-57 human heavy 
chain and KCle human k transgenes was immunized with purified 
human soluble CD4 (a human glycoprotein antigen) . Twenty fig 
of purified human CD4 (NEN Research products, Westwood, MA) 
insolublized by conjugation to polystyrene latex particles 
(Polysciences , Warrington , PA) was in j ected intraperitoneally 
in saline with dimethyldioctadecyl ammonium bromide 
(Calbiochem, San Diego, CA) on Day 0, followed by further 
injections on Day 2 0 and Day 34. 

Retro-orbital bleeds were taken on Days 2 5 and 40, 
and screened for the presence of antibodies to CD4, containing 
human IgM or human IgG heavy chain by ELISA. Briefly, 
purified human CD4 was coated onto PVC microtitre plates at 
2.5 nq/ml and incubated with culture supernate diluted 1:4/1:5 
in PBS, 0.5% Tween-20, 5% chicken serum. Plates were washed, 
followed by addition of horseradish peroxidase-con jugated goat 
antiserum specific for human IgG Fc or rabbit antiserum 
specific for human IgM Fc5Mu (Jackson ImmunoResearch, Westr 
Grove, PA) . Presence of conjugate bound to captured antibody 
was determined after further washing by addition of ABTS 
substrate. Human ^ reactive with antigen was detected in both 
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bleeds, while there was essentially undetectable 7 reactivity. 
The Day 40 sample was also tested for antigen-reactive human k 
chain using the same assay with goat anti-human k peroxidase 
conjugate (Sigma, St* Louis, MO) . CD4-binding k reactivity 
was detected at this time point. The assay results are shown 
in Fig. 48. 

EXAMPLE 27 

This example shows the successful generation of mice 
which are homozygous for functionally disrupted murine heavy 
and light chain loci (heavy chain and k chain loci) and which 
concomitantly harbor a human heavy chain transgene and a human 
light chain transgene capable of productively rearranging to 
encode functional human heavy chains and functional human 
light chains. Such mice are termed "0011" mice, indicating by 
the two O's in the first two digits that the mice lack 
functional heavy and light chain loci and indicating by the 
l"s in the second two digits that the mice are hemizygous for 
a human heavy chain transgene and a human light chain 
transgene. This example shows that such 0011 mice are capable 
of making a specific antibody response to a predetermined 
antigen, and that such an antibody response can involve 
isotype switching . 

0011/0012 Mice: Endogenous la Knockout + Human la Transaenes 

Mice which were homozygous for a functionally 
disrupted endogenous heavy chain locus lacking a functional J H 
region (designated JHD+4* or JHA++) and also harboring the 
human HC1 transgene, such as the HC1-2 6 transgenic mouse line 
described supra, were interbred with mice homozygous for a 
functionally disrupted endogenous kappa chain locus lacking a 
functional J H region (designated here as JKD++ or JKA++; see 
Example 9) to produce mice homozygous for functionally 
disrupted heavy chain and kappa chain loci (heavy chain/kappa 
chain knockouts) , designated as JHD++/JKD++ and containing a 
HC1 transgene. Such mice were produced by interbreeding and 
selected on the basis of genotype as evaluated by Southern 
blot of genomic DNA. These mice, designated HC1- 
26+/ JKD++/JHD++ mice, were interbred with mice harboring a 
human kappa chain transgene (lines KC2-1610, KCle-1399, and 
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KCle-1527; see Example 25), and Southern blot analysis of 
genomic DNA was used to identify offspring mice homozygous for 
functionally disrupted heavy and light chain loci and also 
hemizygous for the HC1 transgene and the KC2 or KCle 
5 transgene. Such mice are designated by numbers and were 
identified as to their genotype, with the following 
abbreviations: HC1-2 6+ indicates hemizygosity for the HC1-2 6 
line human heavy chain minilocus transgene integration; JHD++ 
indicates homozygosity for J H knockout; JKD++ indicates 

10 homozygosity for J K knockout; KC2-1610+ indicates hemizygosity 
for a KC2 human k transgene integrated as in line KC2-1610; 
KCle-1527+ indicates hemizygosity for a KCle human k transgene 
integrated as in line KCle-1527; KCle-1399+ indicates 
hemizygosity for a KCle human k transgene integrated as in 

15 line KCle-1399. 

The resultant individual offspring were each given a 
numerical designation (e.g., 6295, 6907, etc.) and each was 
evaluated for the presence of J H knockout alleles, J K knockout 
alleles, HC1-2 6 transgene, and k transgene (KC2 or KCle) and 

20 determined to be either hemizygous ( + ) or homozygous (++) at 
each locus. Table 10 shows the number designation, sex, and 
genotypes of several of the offspring mice. 

Table 10 



ID No. 


Sex 


la Code 


Genotype 








6295 


M 


0011 


HC1- 


26+; 


JHD++J 


• JKD++ 


;KC2- 


1610+ 


6907 


M 


0011 


HC1- 


2 6+; 


JHD++; 


• JKD++ 


;KCle 


-1527+ 


7086 


F 


0011 


HC1- 


26+; 


JHD++ , 


• JKD++ 


;KCle 


-1399+ 


7088 


F 


0011 


HC1- 


26+; 


JHD++ , 


* JKD++ 


;KCle 


-1399+ 


7397 


F 


0011 


HC1- 


26+; 


JHD++ i 


• JKD++ 
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We removed spleens from three 6 week old female 
mice. Mouse # 7655 was determined by Southern blot 
hybridization to be hemizygous for the HC1 (line 2 6) and KC2 
(line 1610) transgene integrations, and homozygous for the JHa 
5 and Jka targeted deletions of the mouse jtx and kJ regions. 

Mouse #7656 was determined by Southern blot hybridization to 
be hemizygous for the KC2 (line 1610) transgene integration 
and homozygous for the JHa and Jka targeted deletions of the 
mouse fx and kJ regions. Mouse # 7777 was determined by 
10 Souther blot hybridization to be hemizygous for the JHa and 

Jka targeted deletions of the mouse fj, and kJ regions. Because 
of the recessive nature of these deletions, this mouse should 
be phenotypically wild-type. 

15 Expression of Endogenous la Chains in 0011 Mice 

FACS analysis using a panel of antibodies reactive 
with either human \i, mouse /i, hman k, mouse k, or mouse A was 
used to sort lymphocytes explanted from (1) a wildtype mouse 
(7777), (2) a 0001 mouse homozygous for heavy chain and kappa 

20 knockout alleles and harboring a human light chain transgene 
(7656) , and (3) a 0011 mouse homozygous for heavy chain and 
kappa knockout alleles and harboring a human light chain 
transgene and a human heavy chain transgene (7 655) . 

We prepared single cell suspensions from spleen and 

25 lysed the red cells with NH 4 Cl, as described by Mishell and 

Shiigi (Mishell, B.B. & Shiigi, S.M. (eds) Selected Methods in 
Cellular Immunology . W.H. Freeman & Co. , New York, 1980) . 
The lymphocytes are stained with the following reagents: 
propidium iodide (Molecular Probes, Eugene, OR) , FITC 

30 conjugated anti-human IgM (clone G20-127; Pharmingen, San 
Diego, CA) , FITC conjugated anti-mouse IgM (clone R6-60.2; 
Pharmingen, San Diego, CA) , phycoerythrin conjugated anti- 
human Ig/c (clone HP6062; CalTag, South San Francisco, CA) , 
FITC conjugated anti-mouse IgX (clone R26-46; Pharmingen, San 

35 Diego, CA) FITC conjugated anti-mouse B220 (clone RA3-6B2; 
Pharmingen, San Diego, CA) , and Cy-Chrome conjugated anti- 
mouse B220 (clone RA3-6B2; Pharmingen, San Diego, CA) . We 
analyzed the stained cells using a FACScan flow cytometer and 
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LYSIS II software (Becton Dickinson, San Jose, CA) . 
Macrophages and residual red cells are excluded by gating on 
forward and side scatter. Dead cells are excluded by gating 
out propidium iodide positive cells. The flow cytometric data 
in Figs. 49 and 50 confirms the Southern blot hybridization 
data and demonstrates that mouse #7 655 expresses both human n 
and human k and relatively little if any mouse /x or mouse /c. 
Nevertheless a significant fraction of the B cells (about 70- 
80%) appear to express hybrid Ig receptors consisting of human 
heavy and mouse X light chains. 

Fig. 49 shows the relative distribution of B cells 
expressing human fi or mouse jjl on the cell surface; 0011 mouse 
(7655) lymphocytes are positive for human p but relatively 
lack mouse /z; 0001 mouse (7656) lymphocytes do not express 
much human or mouse fi; wildtype mouse (7777) lymphocytes 
express mouse ii but lack human /x. 

Fig. 50 shows the relative distribution of B cells 
expressing human k or mouse k on the cell surface; 0011 mouse 
(7655) lymphocytes are positive for human k but relatively 
lack mouse k ; 0001 mouse (7656) lymphocytes do not express 
much human k or mouse k ; wildtype mouse (7777) lymphocytes 
express mouse k but lack human k . 

Fig. 51 shows the relative distribution of B cells 
expressing mouse X on the cell surface; 0011 mouse (7655) 
lymphocytes are positive for mouse X; 0001 mouse (7656) 
lymphocytes do not express significant mouse X; wildtype mouse 
(7777) lymphocytes express mouse X but at a relatively lower 
level than the 0011 mouse (7655) . 

Fig. 52 shows the relative distribution of B cells 
positive for endogenous mouse X as compared to human k 
(transgene-encoded) . The upper left panel shows the results 
of cells from a wildtype mouse possessing functional 
endogenous heavy and light chain alleles and lacking human 
transgene(s) ; the cells are positive for mouse lambda. The 
upper right panel shows cells from a mouse (#5822) having a k 
knockout background (JKD-f-h) and harboring the human k 
transgene intergration of the KCle-1399 line; the cells are 
positive for human k or mouse X in roughly proportional 
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amounts. The lower left panel shows cells from a mouse 
(#7132) having a k knockout background (JKD++) and harboring 
the human k transgene intergration of the KC2-1610 line; more 
cells are positive for mouse X than for human k, possibly 
indicating that the KC2-1610 transgene integration is less 
efficient than the KCle-1399 transgene integration. The lower 
right panel shows cells from a mouse harboring a human k 
minilocus transgene (KCo4) and lacking a functional endogenous 
murine k allele. The data presented in Fig. 52 also 
demonstrates the variability of phenotypic expression between 
transgenes. Such variability indicates the desirability of 
selecting for individual transgenes and/or transgenic lines 
which express one or more desired phenotypic features 
resulting from the integrated transgene (e.g., isotype 
switching, high level expression, low murine Ig background) . 
Generally, single or multiple transgene species (e.g., pKCle, 
pKC2, KCo4) are employed separately to form multiple 
individual transgenic lines differing by: (1) transgene, (2) 
site(s) of transgene integration, and/or (3) genetic 
background. Individual transgenic lines are examined for 
desired parameters, such as: (1) capability to mount an immune 
respone to a predetermined antigen, (2) frequency of isotype 
switching within transgene-encoded constant regions and/ or 
frequency of trans-switching to endogenous (e.g., murine) Ig 
constant region genes, (3) expression level of transgene- 
encoded immmunoglobulin chains and antibodies, (4) expression 
level of endogenous (e.g., murine) immunoglobulin 
immunoglobulin sequences, and (5) frequency of productive VDJ 
and VJ rearrangement. Typically, the transgenic lines which 
produce the largest concentrations of transgene-encoded (e.g., 
human) immunoglobulin chains are selected; preferably, the 
selected lines produce about at least 40 jLtg/ml of transgene- 
encoded heavy chain (e.g., human \i or human 7) in the serum of 
the transgenic animal and/or about at least 100 /xg/ml of 
transgene-encoded light chain (e.g., human k) . 

Mice were examined for their expression of human and 
murine immmunoglobulin chains in their unimmunized serum and 
in their serum following immunization with a specific antigen, 
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human CD4 . Fig- 53 shows the relative expression of human /x, 
human 7, murine /x, murine 7, human k , murine k , and murine X 
chains present in the serum of four separate unimmunized 0011 
mice of various genotypes (nt = not tested) ; human k 
predominates as the most abundant light chain , and human /x and 
murine 7 (putatively a product of trans-switching) are the 
most abundant heavy chains, with variability between lines 
present, indicating the utility of a selection step to 
identify advantageous genotypic combinations that minimize 
expression of murine chains while allowing expression of human 
chains. Mice #6907 and 7088 show isotype switching (cis- 
switching within the transgene) from human /x to human 7. 

Fig. 54 shows serum immunoglobulin chain levels for 
human /x (hu/x) , human 7 (hU7) , human k (hu/c) , murine /x (msjx) , 
murine 7 (ms7) , murine k (ms/c) , and murine X (msX) in mice of 
the various 0011 genotypes. 
Specific Antibody Response in 0011 Mice 

An 0011 mouse (#6295) was immunized with an 
immunogenic dose of human CD4 according to the following 
immunization schedule: Day 0, intraperitoneal injection of 100 
jtxl of CD4 mouse immune serum; Day 1, inject 2 0 jug of human CD 4 
(American Bio-Tech) on latex beads with DDA in 100 /xl; Day 15 
inject 2 0 /xg of human CD4 (American Bio-Tech) on latex beads 
with DDA in 100 /xl; Day 29 inject 20 fig of human CD4 (American 
Bio-Tech) on latex beads with DDA in 100 /xl; Day 4 3 inject 2 0 
/xg of human CD4 (American Bio-Tech) on latex beads with DDA in 
100 /xl. 

. Fig. 55 shows the relative antibody response to CD4 
immunization at 3 weeks and 7 weeks demonstrating the presence 
of human /x, human k , and human 7 chains in the anti-CD4 
response. Human 7 chains are present at significantly 
increased abundance in the 7 week serum, indicating that cis- 
switching within the heavy chain transgene (isotype switching) 
is occurring in a temporal relationship similar to that of 
isotype switching in a wildtype animal. 

Fig. 56 shows a schematic compilation of various 
human heavy chain and light chain transgenes. 
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EXAMPLE 28 

This example provides for the targeted knockout of 
the murine X light chain locus. 

Targeted Inactivation of the Murine Lambda Light Chain Locus 

Unlike the Ig heavy and kappa light chain loci, the 
murine V\J\ and CX gene segments are not grouped into 3 
families arranged in a 5 ' to 3 ' array, but instead are 
interspersed* The most 5 1 portion consists of two V segments 
(VX2 and VXX) which are followed, proceeding in a 3 ' 
direction, by two constant region exons, each associated with 
its own J segment (JX2CX2 and the pseudogene JX4CX4) . Next is 
the most extensively used V segment (VXl) which is followed by 
the second cluster of constant region exons (JX3CX3 and 
JXlCXl,). Overall the locus spans approximate 200 kb, with 
intervals of -2 0-90 kb between the two clusters. 

Expression of the lambda locus involves 
rearrangement of VX2 or VXX predominantly to JX2 and only 
rarely further 3 1 to JX3 or JXl. VXl can recombine with both 
JX3 and JXl- Thus the lambda locus can be mutated in order to 
fully eliminate recombination and expression of the locus. 

The distance between the two lambda gene clusters 
makes it difficult to inactivate expression of the locus via 
the generation of a single compact targeted deletion, as was 
used in inactivating the murine Ig heavy and kappa light chain 
loci. Instead, a small single deletion which would eliminate 
expression lambda light chains spans approximately 12 0 kb, 
extending from JX2CX2 to JXicXi (Fig. 57) . This removes all 
of the lambda constant region exons as well as the VXl gene 
segment , ensuring inactivation of the locus . 

Replacement type targeting vectors (Thomas and 
Capecchi (1987) op_ = _cit) are constructed in which the deleted 
120 kb is replaced with the selectable marker gene, neo, in a 
PGK expression cassette. The marker is embedded within 
genomic lambda sequences flanking the deletion to provide 
homology to the lambda locus and can also contain the RSV-tk 
gene, at the end of one of the regions of homology, to allow 
for enrichment for cells which have homologous ly integrated 
the vectors. Lambda locus genomic clone sequences are 
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obtained by screening of a strain 12 9/Sv genomic phage library 
isogenic to the ES line being targeted , since the use of 
targeting vectors isogenic to the chromosomal DNA being 
targeted has been reported to enhance the efficiency of 
homologous recombination* Targeting vectors are constructed 
which differ in their lengths of homology to the lambda locus. 
The first vector (vector 1 in Fig. 58) contains the marker 
gene flanked by total of approximately 8-12 kb of lambda locus 
sequences. For targeting events in which replacement vectors 
mediate addition or detection of a few kb of DNA this has been 
demonstrated to be a more than sufficient extent of homology 
(Hasty et al. (1991) op.cit : Thomas et al.(1992) op.cit. ) . 
Vectors with an additional approximately 4 0-60 kb of flanking 
lambda sequence are also constructed (vector 2 in Fig. 58) . 
Human Ig miniloci of at least 80 kb are routinely cloned and 
propagated in the plasmid vector pGPl (Taylor et al. (1993) 
op. cit ) . 

An alternative approach for inactivation of the 
lambda locus employs two independent mutations, for example 
mutations of the two constant region clusters or of the two V 
region loci, in the same ES cell. Since both constant regions 
are each contained within -6 kb of DNA, whereas one of the V 
loci spans -19 kb, targeting vectors are constructed to 
independently delete the J\2C\2/ J\4C\4 and the J\3C\3/ JXicXi 
loci. As shown in Fig. 58, each vector consists of a 
selectable marker (e.g., neo or pac) in a PGK expression 
cassette, surrounded by a total of -8-12 kb of lambda locus 
genomic DNA blanking each deletion. The HSV-tk gene can be 
added to the targeting vectors to enrich for homologous 
recombination events by positive-negative selection. ES cells 
are targeted sequentially with the two vectors, such that 
clones are generated which carry a deletion of one of the 
constant region loci; these clones are then targeted 
sequentially with the two vectors, such that clones will be 
generated which carry a deletion of one of the constant region 
loci, and these clones are then targeted to generate a 
deletion of the remaining functional constant region cluster. 
Since both targeting events are thus being directed to the 
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same cell, it is preferable to use a different selectable 
marker for the two targetings. In the schematic example shown 
in Fig. 58, one of the vectors contains the neo gene and the 
other the pac (puromycin N-acetyl transferase) gene* A third 
potential dominant selectable marker is the hyg (hygromycin 
phosphotransferase) gene. Both the pac and hyg genes can be 
been inserted into the PGK expression construct successfully 
used for targeting the neo gene into the Ig heavy and kappa 
light chain loci. Since the two lambda constant region 
clusters are tightly linked, it is important that the two 
mutations reside on the same chromosome. There preferably is 
a 50% probability of mutating the same allele by two 
independent targeting events, and linkage of the mutations is 
established by their co-segregation during breeding of 
chimeras derived from the doubly targeted ES cells. 

EXAMPLE 29 

This example provides for the targeted knockout of 
the murine heavy chain locus. 

Targeted Inactivation of the Murine Heavy Chain Locus 

A homologous recombination gene targeting transgene 
having the structure shown in Fig. 59 is used to delete at 
least one and preferably substantially all of the murine heavy 
chain locus constant region genes by gene targeting in ES 
cells. Fig. 59 shows a general schematic diagram of a 
targeting transgene. Segment (a) is a cloned genomic DNA 
sequence located upstream of the constant region gene(s) to be 
deleted (i.e, proximal to the J H genes); segment (b) comprises 
a positive selection marker, such as pgk-neo; segment (c) is a 
cloned genomic DNA sequence located downstream of the constant 
region gene(s) to be deleted (i.e, distal to the constan 
region gene(s) and and J H genes); and segment (d) , which is 
optional, comprises a negative selection marker gene (e.g., 
HSV-tk) . Fig. 60 shows a map of the murine heavy chain locus 
as taken from Immunoglobulin Genes , Honjo, T, Alt, FW, and 
Rabbits TH (eds.) Academic Press, NY (1989) p. 129. 
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A targeting transgene having a structure according 
to Fig. 59, wherein: (1) the (a) segment is the 11.5 kb insert 
of clone JH8.1 (Chen et al. (1993) Int. Immunol. 5: 647) or an 
equivalent portion comprising about at least 1-4 kb of 
5 sequence located upstream of the murine C/i gene, (2) the (b) 
segment is pgk-neo as described supra, (3) the (c) segment 
comprises the 1674 bp sequence shown in Fig. 61 or a 4-6 kb 
insert isolated from a phage clone of the mouse Ca gene 
isolated by screening a mouse genomic clone library with the 

10 end-labeled oligonucleotide having the sequence: 

5*-gtg ttg cgt gta tea get gaa acc tgg aaa cag ggt gac cag-3 1 
and (4) the (d) segment comprises the HSV-tic expression 
cassette described supra. 

Alternatively, a stepwise deletion of one or more 

15 heavy chain constant region genes is performed wherein a first 
targeting transgene comprises homology regions, i.e., segments 
(a) and (c) , homologous to sequences flanking a constant 
region gene or genes, a first species of positive selection 
marker gene (pgk-neo) , and an HSV-tJc negative selection 

20 marker. Thus, the (a) segment can comprise a sequence of at 

least about 1-4 kb and homologous to a region located upstream 
of C73 and the (c) segment can comprise a sequence of at least 
about 1-4 kb and homologous to a region located upstream of 
072a. This targeting transgene deletes the Cy3 , C7I, 072b, 

25 and 072a genes. This first targeting transgene is introduced 
into ES cells and correctly targeted recombinants are selected 
(e.g., with G418) , producing a correctly targeted C region 
deletion. Negative selection for loss of the HSV- t/c cassette 
is then performed (e.g., with ganciclovir or FIAU) . The 

3 0 resultant correctly targeted first round C deletion 

recombinants have a heavy chain locus lacking the C73, C7I, 
072b, and 072a genes. 

A second targeting transgene comprises homology 
regions, i.e., segments (a) and (c) , homologous to sequences 

35 flanking a constant region gene or genes, a second species of 
positive selection marker gene different that the first 
species (e.g., gpt or pac) , and an HSV-tfc negative selection 
marker. Thus, the (a) segment can comprise a sequence of at 
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least about 1-4 kb and homologous to a region located upstream 
of Ce and the (c) segment can comprise a sequence of at least 
about 1-4 kb and homologous to a region located upstream of 
Ca. This targeting transgene deletes the Ce and Ca genes. 

This second targeting transgene is introduced into 
the correctly targeted C-region recombinant ES cells obtained 
from the first targeting event. Cells which are correctly 
targeted for the second knockout event (i.e., by homologous 
recombination with the second targeting transgene) are 
selected for with a selection drug that is specific for the 
second species of positive selection marker gene (e.g., 
mycophenolic acid to select for gpt; puromycin to select for 
pac) . Negative selection for loss of the HSV-tk cassette is 
then performed (e.g., with ganciclovir or FIAU) . These 
resultant correctly targeted second round C region 
recombinants have a heavy chain locus lacking the Cy3 f Cyl, 
C72b, C72a, Ce , and Ca genes. 

Correctly targeted first-round or second-round 
recombinant ES cells lacking one or more C region genes are 
used for blastocyst injections as described (supra) and 
chimeric mice are produced. Germline transmission of the 
targeted heavy chain alleles is established, and breeding of 
the resultant founder mice is performed to generate mice 
homozygous for C-region knockouts. Such C-region knockout 
mice have several advantages as compared to J H knockout mice; 
for one example, C-region knockout mice have diminished 
ability (or completely lack the ability) to undergo trans- 
switching between a human heavy chain transgene and an 
endogenous heavy chain locus constant region, thus reducing 
the frequency of chimeric human/mouse heavy chains in the 
transgenic mouse. Knockout of the murine gamma genes is 
preferred, although p and delta are frequently also deleted by 
homologous targeting. C-region knockout can be done in 
conjunction with other targeted lesions int he endogenous 
murine heavy chain locus; a C-region deletion can be combined 
with a J H knockout to preclude productive VDJ rearrangement of 
the murine heavy chain locus and to preclude or reduce trans- 
switching between a human heavy chain transgene and the murine 
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heavy chain locus, among others. For some embodiments, it may 
be desirable to produce mice which specifically lack one or 
more C-region genes of the endogenous heavy chain locus, but 
which retain certain other C-region genes; for example, it may 
be preferable to retain the murine Ca gene to allow to 
production of chimeric human/mouse IgA by trans-switching, if 
such IgA confers an advantageous phenotype and does not 
substantially interfere with the desired utility of the mice. 

EXAMPLE 3 0 

This example demonstrates ex vivo depletion of 
lymphocytes expressing an endogenous (murine) immunoglobulin 
from a lymphocyte sample obtained from a transgenic mouse 
harboring a human transgene. The lymphocytes expressing 
murine Ig are selectively depleted by specific binding to an 
anti-murine immunoglobulin antibody that lacks substantial 
binding to human immunoglobulins encoded by the transgene (s) . 
Ex Vivo Depletion of Murine Ig-Expressinq B-cells 

A mouse homozygous for a human heavy chain minilocus 
transgene (HC2) and a human light chain minilocus transgene 
(KCo4) is bred with a C57BL/6 (B6) inbred mouse to obtain 2211 
mice (i.e., mice which: are homozygous for a functional 
endogenous murine heavy chain locus, are homozygous for a 
functional endogenous murine light chain locus, and which 
possess one copy of a human heavy chain transgene and one copy 
of a human light chain transgene) . Such 2211 mice also 
express B6 major and minor histocompatibility antigens. These 
mice are primed with an immunogenic dose of an antigen, and 
after approximately one week spleen cells are isolated. B 
cells positive for murine Ig are removed by solid phase- 
coupled antibody-dependent cell separation according to 
standard methods (Wysocki et al. (1978) Proc. Na tl. Acad. Sci. 
(U.S.A. ) 75 : 2844; MACS magnetic cell sorting, Miltenyi Biotec 
Inc., Sunnyvale, CA) , followed by antibody-dependent 
complement -mediated cell lysis ( Selected Methods in Cellular 
Immunology , Mishell BB and Shiigi SM (eds.), W.H. Freeman and 
Company, New York, 1980, pp. 211-212) to substantially remove 
residual cells positive for murine Ig. The remaining cells in 
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the depleted sample (e.g., T cells, B cells positive for human 
Ig) are injected i.v. , preferably together with additional 
anti-murine Ig antibody to deplete arising B cells, into a 
SCID/B6 or RAG/B6 mouse. The reconstitutued mouse is then 
further immunized for the antigen to obtain antibody and 
affinity matured cells for producing hybridoma clones. 

EXAMPLE 31 

Production of Fully Human Antibodies in Somatic Chimeras 

A method is described for producing fully human 
antibodies in somatic chimeric mice. These mice are generated 
by introduction of embryonic stem (ES) cells, carrying human 
immunoglobulin (Ig) heavy and light chain transgenes and 
lacking functional murine Ig heavy and kappa light chain 
genes, into blastocysts from RAG-1 or RAG- 2 deficient mice. 

RAG-1 and RAG-2 deficient mice (Mombaerts et al. 

(1992) Cell 68 : 869; Shinkai et al. (1992) Cell 68: 855) lack 
murine B and T cells due to an inability to initiate VDJ 
rearrangement and to assemble the gene segments encoding Igs 
and T cell receptors (TCR) . This defect in B and T cell 
production can be complemented by injection of wild-type ES 
cells into blastocysts derived from RAG-2 deficient animals. 
The resulting chimeric mice produce mature B and T cells 
derived entirely from the injected ES cells (Chen et al. 

(1993) Proc. Natl. Acad. Sci. USA 90: 4528). 

Genetic manipulation of the injected ES cells is 
used for introducing defined mutations and/or exogenous DNA 
constructs into all of the B and/or T cells of the chimeras. 
Chen et al. (1993), Proc. Natl. Acad. Sci. USA 90:4528-4532) 
generated ES cells carrying a homozygous inactivation of the 
Ig heavy chain locus, which, when injected into RAG 
blastocysts, produced chimeras which made T cells in the 
absence of B cells. Transfection of a rearranged murine heavy 
chain into the mutant ES cells results in the rescue of B cell 
development and the production of both B and T cells in the 
chimeras. 

Chimeric mice which express fully human antibodies 
in the absence of murine Ig heavy chain or kappa light chain 
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synthesis can be generated. Human Ig heavy and light chain 
constructs are introduced into ES cells homozygous for 
inactivation of both the murine Ig heavy and kappa light chain 
genes. The ES cells are then injected into blastocysts 
derived from RAG2 deficient mice. The resulting chimeras 
contain B cells derived exclusively from the injected ES cells 
which are incapable of expressing murine Ig heavy and kappa 
light chain genes but do express human Ig genes. 
Generation of ES cells Homozygous for Inactivation of the 
Immunoglobulin Heavy and Kappa Light Chain Genes 

Mice bearing inactivated Ig heavy and kappa light 
chain loci were generated by targeted deletion, in ES cells, 
of Ig J~ H and J K /C K sequences, respectively according to known 
procedures (Chen et al. (1993) EMBO J. 12: 821; and Chen et 
al. (1993) Int. Immunol, op.cit ^ . The two mutant strains of 
mice were bred together to generate a strain homozygous for 
inactivation of both Ig loci. This double mutant strain was 
used for derivation of ES cells. The protocol used was 
essentially that described by Robertson (1987, in 
Teratocarcinomas and Embryonic Stem Cells: A Practical 
Approach, p. 71-112, edited by E.J. Robertson, IRL Press). 
Briefly, blastocysts were generated by natural matings of 
homozygous double mutant mice. Pregnant females were 
ovariectomized on day 2.5 of gestation and the "delayed" 
blastocysts were flushed from the uterus on day 7 of gestation 
and cultured on feeder cells, to help maintain their 
undifferentiated state. Stem cells from the inner cell mass 
of the blastocysts, identifiable by their morphology, were 
picked, dissociated, and passaged on feeder cells. Cells with 
a normal karyotype were identified, and male cell lines will 
be tested for their ability to generate chimeras and 
contribute to the germ cells of the mouse. Male ES cells are 
preferable to female lines since a male chimera can produce 
significantly more offspring. 

Introduction of Human Ig Genes into Mouse Ig Hea vy and Kappa 
Light Chain Deficient ES cells 

Human immunoglobulin heavy and light chain genes are 
introduced into the mutant ES cells as either minilocus 
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constructs, such as HC2 and KC-C04, or as YAC clones, such as 
J1.3P. Transf ection of ES cells with human Ig DNAs is carried 
out by techniques such as electroporation or lipofection with 
a cationic lipid. In order to allow for selection of ES cells 
which have incorporated the human DNA, a selectable marker 
either is ligated to the constructs or is co-transf ected with 
the constructs into ES cells. Since the mutant ES cells 
contain the neomycin phosphotransf erse (neo) gene as a result 
of the gene targeting events which generated the Ig gene 
inactivations, different selectable markers, such as 
hygromycin phosphotransferase (hyg) or puromycin N-acetyl 
transferase (pac) , are used to introduce the human Ig genes 
into the ES cells. 

The human Ig heavy and light chain genes can be 
introduced simultaneously or sequentially, using different 
selectable markers, into the mutant ES cells. Following 
transf ection, cells are selected with the appropriate 
selectable marker and drug-resistant colonies are expanded for 
freezing and for DNA analysis to verify and analyze the 
integration of the human gene sequences. 
Generation of Chimeras 

ES clones containing human Ig heavy and light chain 
genes are injected into RAG-2 blastocysts as described 
(Bradley, A. (1987) , in Teratocarcinomas and Embryonic Stem 
Cells: A Practical Approach, p. 113-151, edited by E.J. 
Robertson, IRL Press) and transferred into the uteri of 
pseudopregnant females. Offspring are screened for the 
presence of human antibodies by ELISA assay of serum samples. 
Positive animals are used for immunization and the production 
of human monoclonal antibodies. 

EXAMPLE 3 2 

This example describes the introduction, via 
homologous recombination in ES cells, of a targeted frameshift 
mutation into the murine heavy chain locus leading to a 
deletion of B cells which undergo switch recombination. The 
frameshifted mice are suitable hosts for harboring non-murine 
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(e.g. , human) transgenes encoding human sequence 
immunoglobulins . 

The novel frameshifted mice can be used for 
expressing non-murine (e.g. , human) sequence immunoglobulins 
5 encoded by heavy chain transgene(s) and/or light chain 

transgene(s) , and for the isolation of hybridomas expressing 
class-switched, affinity matured, human sequence antibodies 
from introduced transgenes, among other uses. A frameshift is 
introduced into one of the four mouse JH gene segments and 

10 into the first exon of the mouse fj, gene. The two introduced 
frameshift mutations compensate for each other thus allowing 
for the expression of fully functional murine ^ heavy chain 
when a B cell uses the frameshifted JH for a functional VDJ 
joint. None of the other three JH segments can be used for 

15 functional VDJ joining because of the frameshift in /a, which 
is not compensated in the remaining JH genes. Alternatively, 
compensating frameshifts can be engineered into multiple 
murine JH genes . 

A mouse homozygous for a compensated, frameshifted 

2 0 immunoglobulin heavy chain allele has an approximately 

physiological level of peripheral B cells, and an 
approximately physiological level of serum IgM comprising both 
murine and human /x. However, B cells recruited into germinal 
centers frequently undergo a class switch to a non-/U isotype. 
25 Such a class switch in B cells expressing the endogenous 

murine ^ chain leads to the expression of a non-compensated 
frameshift mRNA, since the remaining non-ju C H genes do not 
possess a compensating frameshift. The resulting B cells do 
not express a B cell receptor and are deleted. Hence, B cells 

3 0 expressing a murine heavy chain are deleted once they reach 

the stage of differentiation where isotype switching occurs. 
However, B cells expressing heavy chains encoded by a non- 
murine (e.g., human) transgene capable of isotype switching 
and which does not contain such isotype-restrictive 
3 5 frameshifts are capable of further development, including 
isotype switching and/or affinity maturation, and the like. 

Therefore, the frameshifted mouse has an impaired 
secondary response with regard to murine heavy chain (/x) but a 



195 

significant secondary response with regard to transgene- 
encoded heavy chains. If a heavy chain transgene that is 
capable of undergoing class switching is introduced into this 
mutant background, the non-IgM secondary response is dominated 
by transgene expressing B cells. It is thus possible to 
isolate affinity matured human sequence immunoglobulin 
expressing hybridomas from these frameshifted mice. Moreover, 
the frameshifted mice generally possess immunoprotective 
levels of murine IgM, which may be advantageous where the 
human heavy chain transgene can encode only a limited 
repertoire of variable regions. 

For making hybridomas secreting human sequence 
monoclonal antibodies, transgenic mutant mice are immunized; 
their spleens fused with a myeloma cell line; and the 
resulting hybridomas screened for expression of the transgene 
encoded human non-ju isotype. Further, the frameshifted mouse 
may be advantageous over a JH deleted mouse because it will 
contain a functional /x switch sequence adjacent to a 
transcribed VDJ which serves as an active substrate for cis- 
switching (Gu et al. (1993) Cell 73: 1155); thus reducing the 
level of trans-switched B cells that express chimeric 
human/mouse antibodies . 
Construction of Frameshift Vectors 

Two separate frameshift vectors are built. One of 
the vectors is used to introduce 2 nucleotides at the 3 f end 
of the mouse J 4 gene segment, and one of the vectors is used 
to delete those same two nucleotides from the 5 1 end of exon 1 
of the mouse ju gene. 

1. JH vector. 

A 3.4 kb XhoI/EcoRI fragment covering the mouse 
heavy chain J region and the /x intronic enhancer is subcloned 
into a plasmid vector that contains a neomycin resistance gene 
as well as a herpes thymidine kinase gene under the control of 
a phosphoglycerate kinase promoter (tk/neo cassette; Hasty et 
al., (1991) Nature 350 ; 243). This clone is then used as a 
substrate for generating 2 different PCR fragments using the 
following oligonucleotide primers: 
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o-Al 5 1 

o-A2 5 1 

o-A3 5 1 

ggc etc -3 ' 
o-A4 5 1 



cca cac tct 
ggt gac tga 
ggt tac etc 

agg etc cac 



gca tgc tgc 
ggt acc ttg 
agt cac cgt 

cag acc tct 



aga age ttt 
acc cca gta 
etc etc aga 

eta gac age 



tct gta -3 1 
gtc cag -3 1 
ggt aag aat 

aac tac -3 1 



Oligonucleotides o-Al and o-A2 are used to amplify a 
1.2 kb fragment which is digested with SphI and Kpnl . 
Oligonucleotides o-A3 and o-A4 are used to amplify a 0.6 kb 
fragment which is digested with Kpnl and Xbal. These two 
digested fragments are then cloned into SphI/ Xbal digested 
plasmid A to produce plasmid B. 

Plasmid B contains the 2 nucleotide insertion at the 
end of the J 4 and, in addition, contains a new Kpnl site 
upstream of the insertion. The Kpnl site is used as a 
diagnostic marker for the insertion. 

Additional flanking sequences may be cloned into the 
5 1 Xhol site and the 3 1 EcoRI site of plasmid B to increase 
its homologous recombination efficiency. The resulting 
plasmid is then digested with SphI, or another restriction 
enzyme with a single site within the insert, and 
electroporated into embryonic stem cells which are then 
selected with G418 as described by Hasty et al. (1991) op. cit. 
Homologous recombinants are identified by Southern blot 
hybridization and then selected with FIAU as described by 
Hasty et al. to obtain deleted subclones which contain only 
the 2 base pair insertion and the new Kpnl site in JH4 . These 
are identified by Southern blot hybridization of Kpnl digested 
DNA and confirmed by DNA sequence analysis of PCR amplified 
JH4 DNA. 

The resulting mouse contains a JH4 segment that has 
been converted from the unmutated sequence: 

. . . TGGGGTCAAGGAACCTCAGTCACCGTCTCCTCAG gtaagaatggcctctcc . . . 

TrpGlyGlnGlyThrSerValThrVAlSerSerGlu 
to the mutant sequence: 

. . . TGGGGTCAAGGTACCTCAGTCACCGTCTCCTCAGAGgtaagaatggcctctcc . . . 
TrpGlyG 1 nG ly Thr S er Va 1 Thr VA 1 S er S er G lu 
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u Exon 1 Vector 

Using similar in vitro mutagenesis methodology 
described above to engineer a two base pair insertion into the 
JH4 gene segment, PCR products and genomic subclones are 
assembled to create a vector containing a two base pair 
deletion at the 5' end of the first jjl exon. In addition, to 
mark the mutation, a new XmnI site is also introduced 
downstream by changing an A to a G. 

The sequence of the unmutated /x gene is: 
. . . ctggtcctcagAGAGTCAGTCCTTCCCAAATGTCTTCCCCCTCGTC . . . 

GluSerGlnSerPheProAsnValPheProLeuVal 
The sequence of the mutated \i gene is: 

XmnI 

. . . ctggtcctcag AGTCAGTCCTTCCCGAATGTCTTCCCCCTCGTC . . . 

SerGlnSerPheProAsnValPheProLeuVal 
The homologous recombination vector containing the mutant 
sequence is linearized and electroporated into an ES cell line 
containing the JH4 insertion. Homologous recombinants are 
identified from neomycin-resistant clones. Those homologous 
recombinants that contain the frameshift insertion on the same 
chromosome as the JH4 insertion are identified by Southern 
blot hybridization of KpnI/BamHI digested DNA. The JH4 
insertion is associated with a new Kpnl site that reduces the 
size of the J-/i intron containing KpnI/BamHI fragment from the 
wild type 11.3 kb to a mutant 9 kb. The resulting clones are 
then selected for deletion of the inserted tk/neo cassette 
using FIAU. Clones containing the mutant /x exon are 
identified by Southern blot hybridization of XmnI digested 
DNA. The mutation is confirmed by DNA sequence analysis of 
PCR amplified [i exonl DNA. 
Generation of Frameshifted Mice 

The ES cell line containing both the two base pair 
insertion in JH4 , and the two base pair deletion in jjl exon 1, 
is then introduced into blastocyst stage embryos which are 
inserted into pseudopregnant females to generate chimeras. 
Chimeric animals are bred to obtain germline transmission, and 
the resulting animals are bred to homozygosity to obtain 
mutant animals homozygous for compensated frameshifted heavy 
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chain loci and having impaired secondary humoral immune 
responses in B cells expressing murine heavy chains. 

A human heavy chain transgene, such as for example 
pHCl or pHC2 and the like, may be bred into the murine heavy 
chain frameshift background by crossbreeding mice harboring 
such a human transgene into mice having the frameshifted 
murine IgH locus. Via interbreeding and backcrossing, mice 
homozygous at the murine IgH locus for /^-compensated 
frameshifted murine IgH alleles (i-e., capable of compensated 
in-frame expression of only murine /i and not murine non-/x 
chains) and harboring at least one integrated copy of a 
functional human heavy chain transgene (e.g., pHCl or pHC2) 
are produced. Such mice may optionally contain knockout of 
endogenous murine k and/ or X loci as described supra, and may 
optionally comprise a human or other non-murine light chain 
transgene (e.g., pKCle, pKC2 , and the like). 

Alternatively, the human transgene (s) (heavy and/or 
light) may comprise compensating frameshifts, so that the 
transgene J gene(s) contain a frameshift that is compensated 
by a frameshift in the transgene constant region gene(s) . 
Trans-switching to the endogenous constant region genes is 
uncompensated and produces a truncated or nonsense product; B 
cells expressing such uncompensated trans-switched 
immunoglobulins are selected against and depleted. 

EXAMPLE 3 3 

Endogenous Heavy Chain Inactivation bv D Re gion Ablation 

This example describes a positive-negative selection 
homologous recombination vector for replacing the mouse 
germline immunoglobulin heavy chain D region with a 
nonfunctional rearranged VDJ segment. The resulting allele 
functions within a B cell as a normal non-productive allele, 
with the allele undergoing intra-allele heavy chain class 
switching, thereby reducing the level of trans-switching to an 
active transgene locus. 
D Region Targeting Construct 

An 8-15 kb DNA fragment located upstream of the 
murine D region is isolated and subcloned from a mouse strain 
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129 phage library using an oligonucleotide probe comprising 
approximately 5 0 consecutive nucleotides of the published 
sequence for the DFL16.1 segment listed in GenBank. DFL16.1 
is the upstream D segment (i.e., proximal to the V region gene 
cluster and distal to the constant region gene cluster) . 

Similarly, a 9.5 kb BamHI fragment containing JH3 , 
JH4, E/z, S/if and the first two coding exons of the \jl constant 
region is isolated and subcloned from a mouse strain 129 
genomic phage library. 

A 5-10 kb rearranged VDJ is then isolated from a 
mouse hybridoma (any strain) and a synthetic linker containing 
a stop codon is inserted into the J segment. The stop linker 
within the J is preferable to an out-of-frame VDJ junction 
because of the possibility of V replacement rearrangements. 

These three fragments are assembled together with a 
PGKneo positive selection cassette and a PGKHSVtk negative 
selection cassette to form a positive-negative selection 
vector for eliminating the mouse D region in 129-derived ES 
cells (e.g., AB1) by homologous recombination. The targeting 
vector is formed by ligating the 8-15 kb DNA fragment to the 
positive selection cassette (e.g., PGKneo), which is itself 
ligated to the rearranged 5-10 kb rearranged VDJ, which is 
itself ligated to the 9.5 kb BamHI fragment; the negative 
selection cassette (e.g., PGKHSVtk) is then ligated at either 
end of the targeting construct. The construction of such a D 
region targeting vector is shown schematically in Fig. 63. 

The D region targeting construct is transferred into 
AB1 ES cells, positive and negative selection is performed as 
described above, and correctly targeted ES cells are cloned. 
The correctly targeted ES cell clones are used for blastocyst 
injections and chimeric mice are produced. The chimeric mice 
are bred to produce founder mice harboring a D-region 
inactivated heavy chain allele. Interbreeding of offspring is 
performed to produce homozygotes lacking a functional 
endogenous heavy chain locus. Such homozygotes are used to 
crossbreed to mice harboring human Ig transgenes (e.g., pHCl, 
pHC2, pKC2, pKCle, KCo4) to yield (by further backcrossing to 
the homozygotes lacking a functional D-region) mice lacking a 
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functional endogenous heavy chain locus and harboring a human 
heavy transgene (and preferably also a human light chain 
transgene) . In embodiments where some functional endogenous 
light chain loci remain (e.g., X loci), it is generally 
5 preferred that transgenes contain transcriptional control 
sequences that direct high level expression of human light 
chain (e.g., tc) polypeptides, and thus allow the transgene 
locus to compete effectively with the remaining endogenous 
light chain (e.g., X) loci. For example, the Co4 kappa light 
10 chain transgene is generally preferred as compared to pKCl 
with regard to the ability to compete effeectively with the 
endogenous X loci in the transgenic animal. 

EXAMPLE 3 4 

15 This example describes expansion of the human light 

chain transgene V gene repertoire by co-injection of a human k 
light chain minilocus and a yeast artificial chromosome 
comprising a portion of the human V/c locus. 

Introduction of Functional Human Light Chain V Segments by Co- 
2 0 Injection of V*-Containina YAC DNA and a k Minilocus 

An approximately 450 kb YAC clone containing part of 
the human V/c locus was obtained as a non-amplified YAC DNA 
from clone 4X17E1 of the publicly available ICRF YAC library 
(Larin et al. (1991) Proc. Natl. Acad. Sci. (U.S.A.) 88: 4123; 

2 5 Genome Analysis Laboratory, Imperial Cancer Research Fund, 

London, UK) . The 450 kb YAC clone was isolated without prior 
amplification by standard pulsed-field gel electrophoresis as 
per the manufacturer's specifications (CHEF DR-II 
electrophoresis cell, Bio-Rad Laboratories, Richmond, CA) . 

3 0 Six individual pulse field gels were stained with ethidium 

bromide and the gel material containing the YAC clone DNA was 
excised from the gel and then embedded in a new (low melting 
point agarose in standard gel buffer) gel cast in a triangular 
gel tray. The resulting triangular gel (containing the six 
3 5 excised YAC-containing gel blocks) was extended at the apex 
with a narrow agarose gel with 2 M NaOAc in addition to the 
standard electrophoresis buffer. The gel was then placed in 
an electrophoresis chamber immersed in standard gel buffer. 
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The Y-shaped gel former rises above the surface of the buffer 
so that current can only flow to the narrow high salt gel 
portion. A plexiglas block was placed over the high salt gel 
slice to prevent diffusion of the NaOAc into the buffer. The 
5 YAC DNA was then electrophoresed out of the original excised 
gel sliced (embedded) and into the narrow high salt gel 
portion. At the point of transition from the low salt gel to 
the high salt gel, there is a resistance drop that effectively 
halts the migration of the DNA at the apex of the triangular 
10 gel. 

Following electrophoresis and staining with ethidium 
bromide, the concentrated YAC DNA was cut away from the rest 
of the gel and the agarose was digested with GELase (Epicentre 
Technologies, Madison, Wisconsin) . Cesium chloride was then 

15 added to the resultant YAC-containing liquid to obtain a 

density of 1.68 g/ml. This solution was centrifuged at 37,000 
rpm for 36 hours to separate the YAC DNA from any 
contaminating material. 0.5 ml fractions of the resulting 
density gradient were isolated and the peak DNA fraction was 

20 dialyzed against 5 mM Tris (pH 7.4), 5 mM NaCl, 0.1 M EDTA. 

Following dialysis, the concentration of the resulting 0.65 ml 
solution of YAC DNA was found to contain 2 /zg/ml of DNA. This 
YAC DNA was mixed with purified DNA insert from plasmids pKClB 
and pKV4 at a ratio of 20:1:1 (micrograms YAC4xl7El:KClB:KV4) . 

25 The resulting 2 jug/ml solution was injected into the pronuclei 
of half -day B6CBF2 embryos, and 95 surviving microinjected 
embryos were transferred into the oviducts of pseudopregnant 
females. Twelve mice which developed from the microinjected 
embryos were born. 

30 

EXAMPLE 3 5 

This example describes class-switching, somatic 
mutation, and B cell development in immunized transgenic mice 
homozygous for an inactivated endogenous immunoglobulin locus 
35 and containing the HC1 or HC2 heavy chain transgene (s) . 

To demonstrate that a human sequence germline 
configuration minilocus can functionally replace the authentic 
locus, we bred a mouse strain lacking endogenous IgH with 
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strains containing human germline-conf iguration IgH 
transgenes. The two transgene miniloci, HC1 and HC2 , include 
one and four functional variable (V) segments respectively 10 
and 16 diversity (D) segments respectively, all six joining 
5 (JH) segments, and both the \x and 71 constant region segments* 
The miniloci include human cis-acting regulatory sequences — 
such as the JH— intronic enhancer and the \i and 7I switch 
sequences — that are closely linked to the coding segments. 
They also include an additional enhancer element derived from 

10 the 3 1 end of the rat IgH locus. We crossed HC1 and HC2 

transgenic mice with stem-cell derived mutant mice that lack 
JH segments (JHD mice) as described ( supra ) and cannot 
therefore undergo functional heavy chain rearrangements. The 
resulting transgenic-JHD mice contain B cells that are 

15 dependent on the introduced heavy chain sequences. 

Immunizations and hvbridomas . 

We immunized mice by intraperitoneal injections of 
50-100/xg of antigen. Antigens included human carcinoembryonic 

2 0 antigen (CEA; Crystal Chem, Chicago, IL) , hen eggwhite 

lysozyme (HEL; Pierce, Rockford, IL) , and keyhole limpet 
hemocyanin (KLH; Pierce, Rockford, IL) . For primary 
injections we mixed the antigen with complete Freund's 
adjuvant, for subsequent injections we used incomplete 

25 Freund's adjuvant (Gibco BRL, Gaithersburg, MD) . We fused 
spleen cells with the non-secreting mouse myeloma P3X63- 
Ag8.653 (ATCC, CRL1580) . We assayed serum samples and 
hybridoma supernatants for the presence of specific and non- 
specific antibody comprising human heavy chain sequences by 

30 ELISA. For detection of non-specific antibodies we coated 
microtiter wells with human heavy chain isotype specific 
antibody (mouse MAb a human IgGl, clone HP6069, Calbiochem, La 
Jolla, CA; mouse MAb a human IgM, clone CH6, The Binding Site, 
Birmingham, UK) and developed with peroxidase conjugated 

35 antisera (horseradish peroxidase conjugated affinity purified 
fab fragment from polyclonal goat a human IgG(fc) , cat # 109- 
036-098; affinity purified horseradish peroxidase conjugated 
polyclonal rabbit a human IgM(fc) , cat # 309-035-095. Jackson 
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Immuno Research, West Grove, PA) . For detection of antigen- 
specific antibodies we coated microtiter wells with antigen 
and developed with peroxidase-conjugated human heavy chain 
isotype specific antisera. We detected bound peroxidase by 
5 incubation with hydrogen peroxide and 

2,2' -Azino-bis- ( 3-Ethylbenzthiazoline-6-Sulf onic Acid, Sigma 
Chem. Co., St. Louis, MO). The reaction product is measured 
by absorption at 415 nm, and corrected for absorption at 
490 nm . 

10 

Flow cytometry . 

We prepared single cell suspensions from spleen, 
bone marrow, and peritoneal cavity, and lysed red cells with 
NH 4 C1, as described by Mishell and Shiigi. The lymphocytes 

15 are stained with the following reagents: Phycoerythrin 

conjugated anti-mouse Ig/c (clone X36; Becton Dickinson, San 
Jose, CA) , FITC conjugated anti-mouse IgD (clone SBA 1, 
Southern Biotech, AL) , FITC conjugated anti-mouse CDS (clone 
53-7.3; Becton Dickinson, San Jose, CA) , FITC conjugated anti- 

20 mouse IgX (clone R26-46; Pharmingen, San Diego, CA) , and Cy- 
Chrome conjugated anti-mouse B220 (clone RA3-6B2; Pharmingen, 
San Diego, CA) . We analyzed the stained cells using a FACScan 
flow cytometer and LYSIS II software (Becton Dickinson, San 
Jose, CA) . Most macrophages, neutrophils, and residual red 

25 cells are excluded by gating on forward and side scatter. 

Rescue of B cell compartment 

In the peritoneal cavity of HC1 transgenic-JHD 
animals we find normal levels of CD5 + B cells and 

3 0 approximately one-quarter the normal level of conventional 
CDS" B cells. The transgenic peritoneal CD5 + B cells are 
similar to the so-called B-l cells described in normal 
animals: they are larger than conventional B and T 
lymphocytes, they express lower levels of B22 0 than the 

35 conventional B cells found in the spleen, and they include a 
higher proportion of X light chain expressing cells. Over 90% 
of the splenic B cells express k, while up to 50% of the 
peritoneal B cells express X. Thus, while the level of 
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conventional B cells is uniformly reduced in all tissues, the 
level of B-l, which are reported to have a much greater 
capacity for self -renewal, appears to be normal in the HC1 
transgenic- JHD animals. 

Class switching . 

In transgenic- JHD mice, repeated exposure to antigen 
results in the production of human 7I antibodies as well as \i 
antibodies. We injected human CEA into transgenic- JHD mice at 
weekly intervals and monitored the serum levels of antigen- 
specific IgM and IgGl over a period of four weeks (Fig. 63). 
At one week there is a detectable IgM response but no IgGl 
response. However, the IgGl response is greater than the IgM 
response after two weeks, and it continues to increase while 
the IgM response remains relatively constant. This pattern — 
an initial IgM reaction followed by an IgG reaction — is 
typical of a secondary immune response; and it suggests that 
cis-acting sequences included in the transgene may be 
responding to cytokines that direct class switching. We have 
considered three possible mechanisms for expression of non-/x 
isotypes, each of which have been discussed in the literature. 
These mechanisms are: alternative splicing, which does not 
involve deletion of the \x gene; "<S-type" switching, which 
involved deletion of the \l gene via homologous recombination 
between flanking repeat sequences; and non-homologous 
recombination between switch regions. The results of our 
experiments, described below, are indicative of a switch 
region recombination model. 

Two types of non-deletional alternative splicing 
mechanisms can be invoked to explain an isotype shift. First, 
it is possible that a single transcript covering both /x and 7I 
is expressed from the transgene; this transcript could be 
alternatively spliced in response to cytokines induced by 
exposure to antigen. Alternative, a cytokine induced sterile 
transcript initiating upstream of 7I could be trans-spliced to 
the m transcript. If either of these mechanisms were 
responsible for the expression of human 71 sequences, then we 
would expect to be able to isolate hybridomas that express 



205 

both fj. and 71. However, although we have screened several 
hundred hyforidoiaas expressing either human m or human 7I, we 
have not found any such double producer , 7l + ) hybridomas. 
This indicates that expression of 7I is accompanied by 
5 deletion of the \i gene. 

Deletion of the \x gene can be mediated by non- 
homologous recombination between the m and 7I switch regions, 
or by homologous recombination between the two flanking 4 00 bp 
direct repeats (ofJL and £m) that are included in the HC1 and 

10 HC2 transgenes. Deletional recombination between cr/x and S/x 
has been reported to be responsible for the IgD + , IgM" 
phenotype of some human B cells. While the first mechanism, 
non-homologous switch recombination, should generate switch 
products of varying lengths, the second mechanism, a^z/S/i 

15 recombination, should always generate the same product. We 
performed a Southern blot analysis of genomic DNA isolated 
from three hybridomas (Fig. 64A) , one expressing m and two 
expressing 7I. We find genomic rearrangements upstream of the 
transgene 7I only in the two the 7I switch regions (Fig. 64B) . 

2 0 Furthermore, neither of the observed structures is compatible 

with homologous recombination between ofj. and Em- ° ur results 
are therefore consistent with a model for 7I isotype 
expression mediated by deletional non-homologous recombination 
between the transgene encoded \i and 7I switch regions. 

25 

Trans-switching . 

In addition to human 7I, we find mouse 7 in the 
serum of HC1 and HC2 transgenic- JHD mice. We have also 
obtained mouse 7 expressing hybridomas from these animals. 

3 0 Because the non-transgenic homozygous JHD animals do not 

express detectable levels of mouse immunoglobulins, we 
attribute the expression of mouse 7 in the HC1 and HC2 
transgenic-JHD animals to the phenomenon of trans-switching. 
All of the transgenic hybridomas that we have analyzed express 
35 either mouse or human constant region sequences, but not both. 
It is therefore unlikely that a trans-splicing mechanism is 
involved. We used PCR amplification to isolate cDNA clones of 
trans-switch products, and determined the nucleotide sequence 



of 10 of the resulting clones (Fig. 65), The 5' 
oligonucleotide in the PCR amplification is specific for the 
transgene encoded VH251, and the 3 1 oligonucleotide is 
specific for mouse 71, 72b, and 73 sequences. We find 
examples of trans-switch products incorporating all three of 
these mouse constant regions. 

Somatic mutation . 

Approximately 1% of the nucleotides within the 
variable regions of the trans-switch products shown in Fig. 7 
are not germline encoded. This is presumably due to somatic 
mutation. Because the mutated sequence has been translocated 
to the endogenous locus, the cis-acting sequences directing 
these mutations could be located anywhere 3 1 of the mouse 7 
switch. However, as we discuss below, we also observe somatic 
mutation in VDJ segments that have not undergone such 
translocations; and this result indicates that sequences 
required by heavy chain somatic mutation are included in the 
transgene . 

To determine if the HC1 and HC2 constructs include 
sufficient cis-acting sequences for somatic mutation to occur 
in the transgenic- JHD mice, we isolated and partially 
sequenced cDNA clones derived from two independent HC1 
transgenic lines and one HC2 line. We find that some of the 
71 transcripts from transgenic-JHD mice contain V regions with 
extensive somatic mutations. The frequency of these mutated 
transcripts appears to increase with repeated immunizations. 
Figs. 66A and 66B show two sets of cDNA sequences: one set is 
derived form an HC1 (line 2 6) transgenic-JHD mouse that we 
immunized with a single injection of antigen 5 days before we 
isolated RNA; the second set is derived from an HC1 (line 26) 
transgenic-JHD mouse that we hyper immunized by injecting 
antigen on three different days beginning 5 months before we 
isolated RNA; the second set is derived from an HC1 (line 26) 
transgenic-JHD mouse that we hyper immunized by injecting 
antigen on three different days beginning 5 months before we 
isolated RNA. Only 2 of the 13 V regions from the 5 day post- 
exposure mouse contain any non-germline encoded nucleotides. 
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Each of these V's contains only a single nucleotide change, 
giving an overall somatic mutation frequency of less than 0.1% 
for this sample. In contrast, none of the 13 V sequences from 
the hyper immunized animal are completely germline, and the 
5 overall somatic mutation frequency is 1.6%. 

Comparison of \i and 71 transcripts isolated from a 
single tissue sample shows that the frequency of somatic 
mutations is higher in transgene copies that have undergone a 
class switch. We isolated and partially sequenced 47 

10 independent /x and 71 cDNA clones from a hyper immunized CHI 

line 57 transgenic-JHD mouse (Fig. 67A and 67B) . Most of the 
jjl cDNA clones are unmodified relative to the germline 
sequence, while over half of the 71 clones contain multiple 
non-germline encoded nucleotides. The 71 expressing cells are 

15 distinct from the /x expressing cells and, while the two 
processes are not necessarily linked, class switching and 
somatic mutation are taking place in the same sub-population 
of B cells. 

Although we do not find extensive somatic mutation 

20 of the VH251 gene in non-hyperimmunized CHI transgenic mice, 
we have found considerable somatic mutation in VH56pl and 
VHSlpl genes in a naive HC2 transgenic mouse. We isolated 
spleen and lymph node RNA from an unimmunized 9 week old 
female HC2 transgenic animal. We individually amplified 7I 

25 transcripts that incorporate each of the four V regions in the 
HC2 transgene using V and 7I specific primers. The relative 
yields of each of the specific PCR products were 
VH56pl»VH51pl>VH4.21>VH251. Although this technique is not 
strictly quantitative, it may indicate a bias in V segment 

3 0 usage in the HC2 mouse. Fig. 68 shows 2 3 randomly picked 71 
cDNA sequences derived from PCR amplifications using an 
equimolar mix of all four V specific primers. Again we 
observe a bias toward VH56pl (19/23 clones) . In addition, the 
VH56pl sequences show considerable somatic mutation, with an 

35 overall frequency of 2.1% within the V gene segment. 

Inspection of the CDR3 sequences reveals that although 17 of 
the 19 individual VH56pl clones are unique, they are derived 
from only 7 different VDJ recombination events. It thus 
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ars that the VH56pl expressing B cells are selected, 
,aps by an endogenous pathogen or self antigen, in the 
*e animal. It may be relevant that this same gene is over- 
esented in the human fetal repertoire. 

iary 

Upstream cis-acting sequences define the 
rtionality of the individual switch regions, and are 
assary for class switching. Our observation — that class 
iching within the HC1 transgene is largely confined to 
_s involved in secondary response, and does not occur 
iomly across the entire B cell population — suggests that 

rrinimal sequences contained with the transgene are 
ziebient. Because the 7 sequences included in this 
struct begin only 116 nucleotides upstream of the start 
a<5bf the 7I sterile transcript, the switch regulatory 
id|i is compact. 

Our results demonstrate that these important cis- 
iijig regulatory elements are either closely linked to 
ividual 7 genes, or associated with the 3' heavy chain 
allfeer included in the HC1 and HC2 transgenes. Because the 

Sid HC2 inserts undergo transgene-autonomous class 
tahing — which can serve as a marker for sequences that are 
sly to have been somatically mutated — we were able to 
ily find hypermutated transcripts that did not originate 
m translocations to the endogenous locus. We found 
atically mutated 7 transcripts in three independent 
nsgenic lines (two HC1 lines and one HC2 line) . It is 
refore unlikely that sequences flanking the integration 
es of the transgene affect this process; instead, the 
nsgene sequences are sufficient to direct somatic mutation. 

EXAMPLE 3 6 

This example describes the generation of hybridomas 
)m mice homozygous for an inactivated endogenous 
mnoglobulin locus and containing transgene sequences 
coding a human sequence heavy chain and human sequence light 
iin. The hybridomas described secrete monoclonal antibodies 
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MA) and/ or CD4 obtained from NEN-DuPont. The ABT sample 
contained a purified 55 kD human CD4 molecule comprised the V x 
through V 3 domains of human CD4 . The recombinant human 
sequence CD4 (produced in CHO-K1 cells) was adsorbed to the 
5 assay plate and used to capture antibody from hybridoma 

supernatants , the captured antibodies were then evaluated for 
binding to a panel of antibodies which bind either human fi, 
human k, human y, murine /x, or murine k . 

One hybridoma was subcloned from its culture plate 
10 well, designated 1F2 . The 1F2 antibody bound to the ABT CD4 
preparation, was positive for human /i and human k, and was 
negative for human y, mouse y, and mouse k . 

Generation of Human la Monclonal Antibodies Derived from 

15 HC2 Transgenic Mice Immunized with Human CD4 and Human IcrE. 

The heavy chain transgene, HC2, is shown in Fig. 56 
and has been described supra (see, Example 34) . 

The human light chain transgene, KCo4 , depicted in 
Fig, 56 is generated by the cointegration of two individually 

20 cloned DNA fragments at a single site in the mouse genome. 

The fragments comprise 4 functional Vk segments, 5 J segments, 
the Ck exon, and both the intronic and downstream enhancer 
elements (see Example 21) (Meyer and Neuberger (1989) , EMBO J. 
8:1959-1964; Judde and Max (1992), Mol. Cell Biol. 12:5206- 

25 5216) . Because the two fragments share a common 3 kb sequence 
(see Fig. 56) , they can potentially integrate into genomic DNA 
as a contiguous 43 kb transgene, following homologous 
recombination between the overlapping sequences. It has been 
demonstrated that such recombination events frequently occur 

3 0 upon microinjection of overlapping DNA fragments (Pieper et 
al. (1992), Nucleic Acids Res. 20:1259-1264). Co-injected 
DNA 1 s also tend to co-integrate in the zygote, and the 
sequences contained within the individually cloned fragments 
would subsequently be jointed by DNA rearrangement during B 

3 5 cell development. Table 12 shows that transgene inserts from 
at least 2 of the transgenic lines are functional. Examples 
of VJ junctions incorporating each of the 4 transgene encoded 
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V segments, and each of the 5 J segments, are represented in 
this set of 3 6 clones ♦ 



Table 12 



line 




V/c65.8 


V«65.15 


i 

I V«65.3 


! Jxi 


! Jx2 


J*3 


J<4 


Jjc5 


#4436 


0 


IX 


4 


3 


14 


1 


0 


2 


1 


#4437 


1 


3 


7 


7 


S 


2 




7 


3 



Human light chain V and J segment usage in KCo4 transgenic 
♦.mice. The table shows the number of PCR clones, amplified from cDNA 
derived from two transgenic lines, which contain the indicated human kappa 
sequences. cDNA was synthesized using spleen RNA isolated from w 
individual KCo4 transgenic mice (mouse #8490, 3 mo., male, KCo4 line 
4437; mouse #8867, 2.5 mo., femaie. KCo4 line 4436). The cDNA was 
amplified bv PCR using a Ck specific oligonucleotide* 5TAG AAG GAA 
TTC AGC AGG CAC AC A AC A GAG GCA GTT CCA 3\ AND A 1:3 
mixture of the following 2 Vk specific oligonucleotides: 5* AGC TTC TCG 
AGC TCC TGC TGC f CT GTT TCC CAG GTG CC 3' and 5* CAG CTT 
CTC GAG CTC CTG CTA CTC TGG CTC (C,A)CA GAT ACC 3\ The 
PCR product was digested with Xhol and EcoRI, and cloned into a piasmid 
vector. Partial nucleotide sequences were determined by the dideoxy chain 
termination method for 18 randomly picked clones from each animal. The 
sequences of each clone were compared to the germiine sequence of the 
unrearranged transgene. 
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Twenty-three light chain minilocus positive and 18 
heavy chain positive mice developed from the injected embryos* 
These mice, and their progeny, were bred with mice containing 
targeted mutations in the endogenous mouse heavy (strain JHD) 
5 and k light chain loci (strain JCKD) to obtain mice containing 
human heavy and k light chain in the absence of functional 
mouse heavy and k light chain loci. In these mice, the only 
mouse light chain contribution, if any, is from the mouse X 
locus. 

10 Table 13 show that somatic mutation occurs in the 

variable regions of the transgene-encoded human heavy chain 
transcripts of the transgenic mice. Twenty-three cDNA clones 
from a HC2 transgenic mouse were partially sequenced to 
determine the frequency of non-germline encoded nucleotides 

15 within the variable region. The data include only the 

sequence of V segment codons 17-94 from each clone, and does 
not include N regions. RNA was isolated from the spleen and 
lymph node of mouse 5250 (HC2 line 2550 hemizygous, JHD 
homozygous) . Single-stranded cDNA was synthesized and y 

20 transcripts amplified by PCR as described [references]. The 
amplified cDNA was cloned into plasmid vectors, and 2 3 
randomly picked clones were partially sequenced by the dideoxy 
chain-termination method. The frequency of PCR- introduced 
nucleotide changes is estimated from constant region sequence 

25 as <0.2%. 

TABLE 13: The Variable Regions of Human y Transcripts in HC2 
Transgenic Mice Contain Non-Germline-Encoded Nucleotides 

Number of non- Frequency of non- 

30 VH Number of germline encoded germline-encoded 

Segment clones nucleotides nucleotides (%) 

VH251 0 

VH56P1 10 100 2.1 

VH51P1 1 5 2.0 

35 VH4.21 3 0 0.0 



Flow cytometry 
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We analyzed the stained cells using a FACScan 
flow cytometer and LYSIS II software (Becton Dickinson, San 
Jose, CA) . Spleen cells were stained with the following 
reagents: propidium iodide (Molecular Probes, Eugene, OR) , 
5 phycoerythrin conjugated a-human Ig/c (clone HP6062; Caltag, S. 
San Francisco, CA) , phycoerythrin conjugated a-mouse Ig/c (clone 
X3 6; Becton Dickinson, San Jose, CA) , FITC conjugated a-mouse 
IgX (clone R26-46; Pharmingen, San diego, CA) , FITC conjugated 
a-mouse Ig/i (clone R6-60.2; Pharmingen, San Diego, CA) , FITC 
10 conjugated a-human Ig/x (clone G2 0-127; Pharmingen, San Diego, 
CA) , and Cy-Chrome conjugated a-mouse B220 (clone RA3-6B2; 
Pharmingen, San Diego, CA) . 

Expression of human Iq transaenes 

15 Figure 69 shows a flow cytometric analysis of spleen cells from 
KCo4 and HC2 mice that are homozygous for both the JHD and JCKD 
mutations. The human sequence HC2 transgene rescued B cell 
development in the JHD mutant background, restoring the 
relative number of B2 2 0 + cells in the spleen to approximately 

20 half that of a wild type animal. These B cells expressed cell 
surface immunoglobulin receptors that used transgene encoded 
heavy chain. The human KCo4 transgene was also functional, and 
competed successfully with the intact endogenous X light chain 
locus. Nearly 95% of the splenic B cells in JHD/ JCKD 

25 homozygous mutant mice that contain both heavy and light chain 
human transgenes (double transgenic) expressed completely human 
cell surface IgM/c . 

Serum Ig levels were determined by ELISA done as 
follows: human fi: microtiter wells coated with mouse Mab a 

3 0 human IgM (clone CH6, The Binding Site, Birmingham, UK) and 
developed with peroxidase conjugated rabbit a human IgM(fc) 
(cat # 309-035-095, Jackson Immuno Research, West Grove, PA) . 
Human y: microtiter wells coated with mouse MAb a human IgGl 
(clone HP6069, Calbiochem, La Jolla, CA) and developed with 

35 peroxidase conjugated goat a human IgG(fc) (cat # 109-036-098, 
Jackson Immuno Research, West Grove, PA). Human k : microtiter 
wells coated with mouse Mab a human Ig/c (cat # 0173, AMAC, Inc. 
Ig/c (cat #A7164, Sigma Chem. Co., St. Louis, MO). Mouse 7: 
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microtiter wells coated with goat a mouse IgG (cat #115-006- 
071 , Jackson Imauno Research, West Grove, PA) . Mouse X: 
microtiter wells coated with rat MAb a mouse IgX (cat # 02171D, 
Pharmingen, San Diego, CA) and developed with peroxidase 
5 conjugated rabbit a mouse IgM(fc) (cat # 309-035-095, Jackson 
Immuno Research, West Grove, PA) . Bound peroxidase is detected 
by incubation with hydrogen peroxide and 2 , 2 1 -Azino-bis-) 3- 
Ethylbenzthiazoline-6-Sulf onic Acid, Sigma Chem. Co., St. 
Louis, MO) . The reaction product is measured by absorption at 
10 415 nm. 

The double transgenic mice also express fully 
human antibodies in the serum. Figure 70 shows measured serum 
levels of immunoglobulin proteins for 18 individual double 
transgenic mice, homozygous for endogenous heavy and kappa 
15 light chain inactivations , derived from several different 
transgenic founder animals. We found detectable levels of 
human ijl, yl, and /c. We have shown supra that the expressed 
human yl results from authentic class switching by genomic 
recombination between the transgene /x and 7I switch regions. 

2 0 Furthermore, we have found that intra-transgene class switching 

was accompanied by somatic mutation of the heavy chain variable 
regions. In addition to human immunoglobulins, we also found 
mouse 7 and X in the serum. The present of mouse X protein is 
expected because the endogenous locus is completely intact. We 

25 have shown elsewhere that the mouse 7 expression is a 

consequence of trans-switch recombination of transgene VDJ 
segments into the endogenous heavy chain locus. This trans- 
switching phenomenon, which was originally demonstrated for 
wild-type heavy chain alleles and rearranged VDJ transgenes 

30 (Durdik et al. (1989), Proc. Natl. Acad. Sci. USA 86:2346-2350; 
Gerstein et al. (1990), Cell 63:537-548), occurs in the mutant 
JHD background because the downstream heavy chain constant 
regions and their respective switch elements are still intact. 

The serum concentration of human IgM/c in the 

3 5 double transgenic mice was approximately 0.1 mg/ml, with very 

little deviation between animals or between lines. However, 

human 7I, mouse 7, and mouse X levels range from 0.1 to 

10 micrograms/ml. The observed variation in 7 levels between 
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individual animals may be a consequence of the fact that 7 is 
an inducible constant region. Expression presumably depends on 
factors such as the health of the animal, exposure to antigens, 
and possibly MHC type. The mouse X serum levels are the only 
5 parameter that appears to correlate with individual transgenic 
lines* KCo4 line 4436 mice which have the fewest number of 
copies of the transgene per integration (approximately 1-2 
copies) have the highest endogenous X levels, while KCo4 line 
4437 mice ("10 copies per integration) have the lowest X 

10 levels. This is consistent with a model in which endogenous X 
rearranges subsequent to the k transgene, and in which the 
serum X level is not selected for, but is instead a reflection 
of the relative size of the precursor B cell pool. Transgene 
loci containing multiple light chain inserts may have the 

15 opportunity to undergo more than one V to J recombination 

event, with an increased probability that one of them will be 
functional. Thus high copy lines will have a smaller pool of 
potential X cells. 

20 Immunizations with human CD4 and IqE 

To test the ability of the transgenic B cells to 
participate in an immune response, we immunized double 
transgenic mice with human protein antigens, and measured serum 
levels of antigen specific immunoglobulins by ELISA. Mice were 

25 immunized with 50 /xg recombinant sCD4 (cat. # 013101, American 
Bio-Technologies Inc., Cambridge, MA) covalently linked to 
polystyrene beads (cat # 08226, Polysciences Inc., Warrington, 
PA) in complete Freund*s adjuvant by intraperitoneal injection. 
Each of the mice are homozygous for disruptions of the 

3 0 endogenous /x and k loci, and hemizygous for the human heavy 

chain transgene HC2 line 2500 and human k light chain transgene 

KCo4 line 4437. 

Methods 

Serum samples were diluted into microtiter wells 
35 coated with recombinant sCD4 . Human antibodies were detected 
with peroxidase conjugated rabbit a human IgM(fc) (Jackson 
Immuno Research, West Grove, PA) or peroxidase conjugated goat 
anti-human Ig/c (Sigma, St. Louis, MO) . 
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Figure 71A shows the primary response of 
transgenic mice immunized with recombinant human soluble CD4 . 
All four of the immunized animals show an antigen-specific 
human IgM response at one week. The CD4-specific serum 
5 antibodies comprise both human /x heavy chain and human k light 
chain. 

To evaluate the ability of the HC2 transgene to 
participate in a secondary response, we hyperimmunized the 
transgenic mice by repeated injection with antigen, and 
10 monitored the heavy chain isotype of the induced antibodies. 
Mice homozygous for the human heavy chain transgene HC2 and 
human k light chain transgene KCo4 were immunized with 2 5 /zg of 
human IgE/c (The Binding Site, Birmingham, UK) in complete 
Freund's adjuvant on day = 0. Thereafter, animals were 
15 injected with IgE/c in incomplete Freund's adjuvant at 

approximately weekly intervals. Serum samples were diluted 
1:10, and antigen-specific ELISAs were performed on human IgE, 
X coated plates. 

Figure 7 IB shows a typical time course of the 

2 0 immune response from these animals: we injected double 

transgenic mice with human IgE in complete Freund's adjuvant, 
followed by weekly boosts of IgE in incomplete Freund's 
adjuvant. The initial human antibody response was IgM/c, 
followed by the appearance of antigen specific human IgG/c . The 
25 induced serum antibodies in these mice showed no cross- 
reactivity to human IgM or BSA. The development, over time, of 
a human IgG 

We have also tested the ability of the heavy 
chain transgene to undergo class switching in vitro: splenic B 

3 0 cells purified form animals hemizygous for the same heavy chain 

construct (HC2, line 2550) switch from human IgM to human IgGl 
in the presence of LPS and recombinant mouse IL-4 . However, in 
vitro switching did not take place in the presence of LPS and 
recombinant mouse IL-2, or LPS alone. 
35 We find human IgM-expressing cells in the spleen, 

lymph nodes, peritoneum, and bone marrow of the double- 
transgenic/double-knockout (0011) mice. Although the 
peritoneal cavity contains the normal number of B cells, the 
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absolute number of transgenic B cells in the bone marrow and 
spleen is approximately 10-50% of normal. The reduction may 
result from a retardation in transgene-dependent B cell 
development. The double-transgenic/ double-knockout (0011) mice 
5 also express fully human antibodies in the serum, with 

significant levels of human ji, yl, and k in these mice. The 
expressed human 71 results from authentic class switching by 
genomic recombination between the transgene fi and 71 switch 
regions. Furthermore, the intratransgene class switching is 

10 accompanied by somatic mutation of the heavy chain variable 
regions encoded by the transgene. In addition to human 
immunoglobulins, we find mouse }i and mouse X in these mice. 
The mouse \i expression appears to be a result of trans- 
switching recombination, wherein transgene VDJ gene is 

15 recombined into the endogenous mouse heavy chain locus. Trans- 
switching, which was originally observed in the literature for 
wild-type heavy chain alleles and rearranged VDJ transgenes, 
occurs in our J H ~^" background because the mouse downstream 
heavy chain constant regions and their respective switch 

20 elements are still intact. 

To demonstrate the ability of the transgenic B 
cells to participate in an immune response, we immunized the 
0011 mice with human protein antigens, and monitored serum 
levels of antigen-specific immunoglobulins. The initial human 

25 antibody response is IgM, followed by the expression of 

antigen-specific human IgG (Fig. 7 IB and Fig. 73) . The lag 
before appearance of human IgG antibodies is consistent with an 
association between class-switching and a secondary response to 
antigen. 

30 In a transgenic mouse immunized with human CD4 , 

human IgG reactivity to the CD4 antigen was detectable at serum 
concentrations ranging from 2 x 10" 2 to 1.6 x 10" 4 . 

Identification of Anti-Human CD4 Hybridomas 
35 A transgenic mouse homozygous for the human heavy 

chain transgene HC2 and human k light chain transgene KCo4 were 
immunized with 2 0 /ig of recombinant human CD4 in complete 
Freund's adjuvant on day 0. Thereafter, animals were injected 
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with CD4 in incomplete Freund's adjuvant at approximately 
weekly intervals. Fig. 73 shows human antibody response to 
human CD4 in serum of the transgenic mouse. Serum samples were 
diluted 1:50, and antigen-specific ELISAs were performed on 
human CD4 coated plates. Each line represents individual 
sample determinations. Solid circles represent IgM, open 
squares represent IgG . 

We also isolated hybridoma cell lines from one of 
the mice that responded to human CD4 immunization. Five of the 
cloned hybridomas secrete human IgG* (human 71/human k) 
antibodies that bind to recombinant human CD4 and do not 
crossreact (as measured by ELISA) with a panel of other 
glycoprotein antigens. The association and dissociation rates 
of the immunizing human CD4 antigen for the monoclonal 
antibodies secreted by two of the IgG* hybridomas, 4E4.2 and 
2C5.1, were determined. The experimentally-derived binding 
constants (K a ) were approximately 9 x 10 7 M" 1 and 8 x 10 7 M" 1 
for antibodies 4E4.2 and 2C5.1, respectively. These K a values 
fall within the range of murine IgG anti-human CD4 antibodies 
that have been used in clinical trials by others (Chen et al. 
(1993) Int. Immunol. 6: 647). 

A mouse of line #7494 (0012 ;HC1- 
2 6+; JHD++ ; JKD++ ; KC2 - 1 6 1 0 ++ ) was immunized on days 0, 13, 20, 
28, 33, and 47 with human CD4 , and produced anti-human CD4 
antibodies comprised of human k and human /x or 7. 

By day 28, human y. and human k were found present 
in the serum. By day 47, the serum response against human CD4 
comprised both human /x and human 7, as well as human /c. On day 
50, splenocytes were fused with P3X63-Ag8 . 653 mouse myeloma 
cells and cultured. Forty-four out of 700 wells (6.3%) 
contained human 7 and/or k anti-human CD4 monoclonal 
antibodies. Three of these wells were confirmed to contain 
human 7 anti-CD4 monoclonal antibodies, but lacked human k 
chains (presumably expressing mouse X) . Nine of the primary 
wells contained fully human IgM/c anti-CD4 monoclonal 
antibodies, and were selected for further characterization. 
One such hybridoma expressing fully human IgM/c anti-CD4 
monoclonal antibodies was designated 2C11-8. 



Primary hybridomas were cloned by limiting 
dilution and assessed for secretion of human fi and k monoclonal 
antibodies reactive against CD4 . Five of the nine hybridomas 
remained positive in the CD4 ELISA. The specificity of these 
human IgM/c monoclonal antibodies for human CD4 was demonstrated 
by their lack of reactivity with other antigens including 
ovalbumin, bovine serum albumin, human serum albumin, keyhole 
limpet hemacyanin, and carcinoembryonic antigen. To determine 
whether these monoclonal antibodies could recognize CD 4 on the 
surface of cells (i.e., native CD4 ) , supernatants from these 
five clones were also tested for reactivity with a CD4 + T cell 
line, Sup Tl. Four of the five human IgM/c monoclonal 
antibodies reacted with these CD44- cells. To further confirm 
the specificity of these IgM/c monoclonal antibodies, freshly 
isolated human peripheral blood lymphocytes (PBL) were stained 
with these antibodies. Supernatants from clones derived from 
four of the five primary hybrids bound only to CD4+ lymphocytes 
and not to CD8+ lymphocytes (Figure 72) . 

Fig. 72 shows reactivity of IgM/c anti-CD4 
monoclonal antibody with human PBL. Human PBL were incubated 
with supernatant from each clone or with an isotype matched 
negative control monoclonal antibody, followed by either a 
mouse anti-human CD4 monoclonal antibody conjugated to PE (top 
row) or a mouse anti-human CD8 Ab conjugated to FITC (bottom 
row) . Any bound human IgM/c was detected with a mouse anti- 
human fM conjugated to FITC or to PE, respectively. 
Representative results for one of the clones, 2C11-8 (right 
side) and for the control IgM/c (left side) are shown. As 
expected, the negative control IgM/c did not react with T cells 
and the goat anti-human n reacted with approximately 10% of 
PBL, which were presumably human B cells. 

Good growth and high levels of IgM/c anti-CD4 
monoclonal antibody production are important factors in 
choosing a clonal hybridoma cell line for development. Data 
from one of the hybridomas, 2C11-8, shows that up to 5 
pg/cell/d can be produced (Figure 74) . Similar results were 
seen with a second clone. As is commonly observed, production 
increases dramatically as cells enter stationary phase growth. 
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Fig. 74 shows cell growth and human IgM/c anti-CD4 monoclonal 
antibody secretion in small scale cultures. Replicate cultures 
were seeded at 2xl0 5 cells/ml in a total volume of 2 ml. Every 
twenty-four hours thereafter for four days, cultures were 
5 harvested. Cell growth was determined by counting viable cells 
and IgM/c production was quantitated by an ELISA for total human 
11 (top panel) . The production per cell per day was calculated 
by dividing the amount of IgM/c by the cell number (bottom 
panel) . 

10 Fig. 75 shows epitope mapping of a human IgM/c 

anti-CD4 monoclonal antibody. Competition binding flow 
cytometric experiments were used to localize the epitope 
recognized by the IgM/c anti-CD4 monoclonal antibody, 2C11-8. 
For these studies, the mouse anti-CD4 monoclonal antibodies, 

15 Leu3a and RPA-T4 , which bind to unique, nonoverlapping epitopes 
on CD4 were used. PE fluorescence of CD4+ cells preincubated 
with decreasing concentrations of either RPA-TA or Leu-3a 
followed by staining with 2C11-8 detected with PE-conjugated 
goat anti-human IgM. IThere was concentration-dependent 

2 0 competition for the binding of the human IgM/c anti-CD4 

monoclonal antibody 2C11-8 by Leu3a but not by RPA-T4 (Figure 
75) . Thus, the epitope recognized by 2C11-8 was similar to or 
identical with that recognized by monoclonal antibody Leu3a, 
but distinct from that recognized by RPA-T4 . 
25 In summary, we have produced several hybridoma 

clones that secrete human IgM/c monoclonal antibodies that 
specifically react with native human CD4 and can be used to 
discriminate human PBLs into CD4 + and CD4" subpopulations . At 
least one of these antibodies binds at or near the epitope 

3 0 defined by monoclonal antibody Leu3a. Monoclonal antibodies 

directed to this epitope have been shown to inhibit a mixed 
leukocyte response (Engleman et al., J. Exp. Med. (1981) 
153:193). A chimeric version of monoclonal antibody Leu3a has 
shown some clinical efficacy in patients with mycosis fungoides 
35 (Knox et al. (1991) Blood 77:20). 

We have isolated cDNA clones from 3 different 
hybridoma cell lines (2C11.8, 2C5.1, and 4E4.2), and have 
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determined the partial nucleotide sequence of some of the 
expressed inununoglobuhn genes in each of these cell lines. 
For sequence analysis, total RNA was isolated from 
approximately 5 x 10 6 hybridoma cells. sscDNA was synthesized 
5 by priming reverse transcription with oligo dT. A portion of 
this sscDNA was used in duplicate PCR reactions primed by a 
pool of oligos with specificities for either (i) heavy chain 
variable framework regions contained within the HC1 or HC2 
transgenes and a single downstream oligo specific for constant 

10 human gamma sequence, or (ii) light chain variable framework 

regions contained within the KC2 or KCo4 transgene and a single 
downstream oligo specific for constant human kappa sequence. 
Products from these PCR reactions were digested with 
appropriate restriction enzymes, gel purified, and 

15 independently cloned into pNN03 vector. DNA was isolated and 
manual dideoxy and/or automated fluorescent sequencing 
reactions performed on dsDNA. 

The characteristics of the three hybridomas, 
2C11.8, 2C5.1, and 4E4.2, are given below in Table 11. 
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Table 11 Human variable region usage in hybridomas 





Subclone 


Specificity 


Isotype 


Vh 


Dh 


Jh 


Vjc 


Jk 


25 


2C1L8 


nCD4 


IgMic 


251 


nd.* 


nd. 


nd. 


nd. 




2C5.1 


rCD4 


igG* 


251 


HQ52 


JHS 


65.15 


JK4 




4E4.2 


rCD4 


IgG* 


251 


HQ52 


JHS 


65.15 


JK4 



* n.d., not determined 



3 0 Nucleotide sequence analysis of expressed heavy and 

light chain sequences from the two IgG/c hybridomas 2C5.1 and 
4E4.2 reveal that they are sibling clones derived from the same 
progenitor B cell. The heavy and light chain V(D)J junctions 
from the two clones are identical, although the precise 

3 5 nucleotide sequences differ by presumptive somatic mutations. 
The heavy chain VDJ junction sequence is: 



VH251 N DHQ52 JH5 

TAT TAC TGT GCG AG (g get cc) A ACT GGG GA C TGG TTC GAC 
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YYCAR AP TGD WFD 

The light chain VJ junction is: 

5 Vk65.15 N Jk4 

TAT AAT AGT TAC CCT CC (t) ACT TTC GGC 

YNSYPP TFG 

The following non-germline encoded codons were identified 
10 (presumptive somatic mutations) : 

2CS . 1 heavy chain AGC->AGG S28R (replacement) 
light chain CCG->ACG P119T (replacement) 

4E4 . 2 heavy chain AGC->AGG S28R (replacement) 

15 CTG->CTA L80L (silent) 

light chain GAG->GAC E4 ID ( replacement ) 

AGG->AAG R61K (replacement) 

CCG->ACG P119T (replacement) 

2 0 We conclude that these two gamma hybridomas are 

derived from B cells that have undergone a limited amount of 
somatic mutation. This data shows that the HC2 transgenic 
animals use the VH5-51 (aka VH2 51) V segment* We have 
previously shown that VH4-34, VH1-69, and VH3-30.3 are 
25 expressed by these mice. The combination of these results 
demonstrates that the HC2 transgenic @ce express all four of 
the transgene encoded human VH genes. 

We conclude that human immunoglobulin-expressing B 
cells undergo development and respond to antigen in the context 

3 0 of a mouse immune system. Antigen responsivity leads to 

immunoglobulin heavy chain isotype switching and variable 
region somatic mutation. We have also demonstrated that 
conventional hybridoma technology can be used to obtain 
monoclonal human sequence antibodies from these mice. 
3 5 Therefore, these transgneic mice represent a source of human 
antibodies against human target antigens . 



EXAMPLE 37 
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This example describes the generation of transgenic 
mice homozygous for an inactivated endogenous heavy chain and k 
chain locus and harboring a transgene capable of isotype 
switching to multiple downstream human C H genes. The example 
5 also demonstrates a cloning strategy for assembling large 

transgenes (e.g., 160 kb) by co-microinjection of multiple DNA 
fragments comprising overlapping homologous sequence joints 
(see Fig. 76) , permitting construction of a large transgene 
from more than two overlapping fragments by homologous 

10 recombination of a plurality of homology regions at distal ends 
of the set of fragments to be assembled in vivo . such as in a 
microinjected ES cell or its clonal progeny. The example also 
shows, among other things, that isolated lymphocytes from the 
transgenic animals can be induced to undergo isotype switching 

15 in vitro , such as with IL-4 and LPS. 

A set of five different plasmid clones was 
constructed such that the plasmid inserts could be isolated, 
substantially free of vector sequences; and such that the 
inserts together form a single imbricate set of overlapping 

20 sequence spanning approximately 150 kb in length. This set 

includes human V, D, J, jjl, y3 , and 7I coding sequences, as well 
as a mouse heavy chain 3 1 enhancer sequence. The five clones 
are, in 5' to 3' order: pH3V4D, pCORlxa, pll-14, pPl-57 0, and 
pHP-3a (Fig. 7 6) . Several different cloning vectors were used 

25 to generate this set of clones. Some of the vectors were 
designed specifically for the purpose of building large 
transgenes. These vectors (pGPla, pGPlb, pGPlc, pGPld, pGPlf , 
pGP2a, and pGP2b) are pBR322-based plasmids that are maintained 
at a lower copy number per cell than the pUC vectors 

30 (Yanisch-Perron et al. (1985) Gene 33 ; 103-119). The vectors 

also include trpA transcription termination signals between the 
polylinker and the 3' end of the plasmid ^-lactamase gene. The 
polylinkers are flanked by restriction sites for the 
rare-cutting enzyme NotI; thus allowing for the isolation of 

35 the insert away from vector sequences prior to embryo 

microinjection. Inside of the NotI sites, the polylinkers 
include unique Xhol and Sail sites at either end. The pGPl 
vectors are described in Taylor et al. (1992) Nucleic Acids 
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Res. 2_3: 6287. To generate the pGP2 vectors, pGPlf was first 
digested with AlwNI and ligated with the synthetic 
oligonucleotides o-236 and o-237 (o-236, 5 1 - ggc gcg cct tgg 
cct aag agg cca -3 1 ; o-237, 5'- cct ctt agg cca agg cgc gcc tgg 
5 -3 1 ) The resulting plasmid is called pGP2a. Plasmid pGP2a was 
then digested with Kpnl and EcoRI, and ligated with the 
oligonucleotides o-288 and o-289 (o-288, 5 1 - aat tea gta teg 
atg tgg tac -3 f ; o-289, 5'- cac ate gat act g -3 1 ) to create 
pGP2b (Figs. 77A and Fig. 77B) . 

10 The general scheme for transgene construction with 

the pGP plasmids is outlined in Fig. 78 (paths A and B) . All 
of the component DNA fragments are first cloned individually in 
the same 5' to 3* orientation in pGP vectors. Insert NotI, 
Xhol and Sail sites are destroyed by oligonucleotide 

15 mutagenesis or if possible by partial digestion, polymerase 
fill-in, and blunt end ligation. This leaves only the 
polylinker derived Xhol and Sail sites at the 5 ' and 3 1 ends of 
each insert. Individual inserts can then be combined stepwise 
by the process of isolating Xhol/Sall fragments from one clone 

2 0 and inserting the isolated fragment into either the 5' Xhol or 

3 1 Sail site of another clone (Fig. 78, path A). Transf ormants 
are then screened by filter hybridization with one or more 
insert fragments to obtain the assembled clone. Because 
Xhol /Sail joints cannot be cleaved with either enzyme, the 
25 resulting product maintains unique 5' Xhol and 3 1 Sail sites, 
and can be used in the step of the construction. A variation 
of this scheme is carried out using the vectors pGP2a and pGP2b 
(Fig. 78, path B) . These plasmids includes an Sfil site 
between the ampicillin resistance gene and the plasmid origin 

3 0 of replication. By cutting with Sfil and Xhol or Sail, inserts 

can be isolated together with either the drug resistance 
sequence or the origin of replication. One Sfil/Xhol fragment 
is ligated to one Sfil /Sail fragment in each step of the 
synthesis. There are three advantages to this scheme: (i) 
3 5 background transf ormants are reduced because sequences from 
both fragments are required for plasmid replication in the 
presence of ampicillin; (ii) the ligation can only occur in a 
single 5' to 3' orientation; and (iii) the Sfil ends are not 
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self -compatible, and are not compatible with Sail or Xhol, thus 
reducing the level of non-productive ligation. The 
disadvantage of this scheme is that insert Sfil sites must be 
removed as well as NotI, Xhol, and Sail sites. These medium 
5 copy vectors are an improvement over the commonly used pUC 
derived cloning vectors. To compare the ability of these 
vectors to maintain large DNA inserts, a 43 kb Xhol fragment 
comprising the human JH/Cjz region was ligated into the Sail 
site of pSP72 (Promega, Madison, WI) , pUC19 (BRL, Grand Island, 

10 NY) , and pGPlf . Transformant colonies were transferred to 

nitrocellulose and insert containing clones were selected by 
hybridization with radiolabeled probe. Positive clones were 
grown overnight in 3 ml media and DNA isolated: EcoRI digestion 
of the resulting DNA reveals that all the pSP72 and pUC19 

15 derived clones deleted the insert (Fig. 79) ; however, 12 of the 
18 pGPlf derived clones contained intact inserts. Both 
orientations are represented in these 12 clones. 

The construction and isolation of the f ive clones 
(pH3V4D, pCORlxa, pll-14, pPl-570, and pHP-3a) used to generate 

20 the HCo7 transgene is outlined below. 

pH3V4D. 

Germline configuration heavy chain variable gene 
segments were isolated from phage 1 genomic DNA libraries using 
25 synthetic oligonucleotide probes for VH1 and VH3 classes. The 
VH1 class probe was o-49: 

5 1 - gtt aaa gag gat ttt att cac ccc tgt gtc etc tec aca ggt gtc 
-3 • 

30 The VH3 class probe was o-184: 

5'- gtt tgc agg tgt cca gtg t(c,g)a ggt gca get g(g,t)t gga gtc 
(t,c) (g,c)g -3' 

Positively hybridizing clones were isolated, 
35 partially restriction mapped, subcloned and partially 

sequenced. From the nucleotide sequence it was determined that 
one of the VH1 clones isolated with the o-49 probe encoded a VH 
gene segment, 49.8, comprising an amino acid sequence identical 
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to that contained in the published sequence of the hvl2 63 gene 
(Chen et al. (1989) Arthritis Rheum. 32: 72) . Three of the VH3 
genes, 184.3, 184.14, and 184.17, that were isolated with the 
o-184 probe contained sequences encoding identical amino acid 
5 sequences to those contained in the published for the VH genes 
DP-50, DP-54, and DP-45 (Tomlinson et al. (1992) J. Mol. Biol. 
227 : 776) . These four VH genes were used to build the pH3V4D 
plasmid. 

The 184.3 gene was found to be contained within a 3 

10 kb BamHI fragment. This fragment was subcloned into the 

plasmid vector pGPlf such that the Xhol site of the polylinker 
is 5 1 of the gene, and the Sail site is 3'. The resulting 
plasmid is called pl84.3.36f. The 184.14 gene was found to be 
contained within a 4.8 kb Hindlll fragment. This fragment was 

15 subcloned into the plasmid vector pUC19 in an orientation such 
that the gene could be further isolated as a 3.5 kb fragment by 
Xhol/ Sail digestion at a genomic Xhol site 0.7 kb upstream of 
the gene and a polylinker derived Sail site 3' of the gene. 
The resulting plasmid is called pl84.14.1. The 184.17 gene was 

2 0 found to be contained within a 5.7 kb Hindlll fragment. This 
fragment was subcloned into the plasmid vector pSP72 (Promega, 
Madison, WI) in an orientation such that the polylinker derived 
Xhol and Sail sites are, respectively, 5' and 3' of the gene. 
The insert of this plasmid includes an Xhol site at the 3 1 end 

25 of the gene which was eliminated by partial digestion with 
Xhol, Klenow fragment filling-in, and religation. The 
resulting plasmid is called pl84.17SK. The 49.8 gene was found 
to be contained within 6.3 kb Xbal fragment. This fragment was 
subcloned into the plasmid vector pNN03 , such that the 

30 polylinker derived Xhol and Clal sites are, respectively, 5' 
and 3 1 of the gene, to create the plasmid pVH49.8 (Taylor et 
al. (1994) International Immunol . 6: 579) . The Xhol/Clal 
insert of pVH49.8 was then subcloned into pGPlf to create the 
plasmid p49.8f, which includes unique Xhol and Sail sites 

35 respectively at the 5 f and 3 1 end of the 49.8 gene. 

The 3.5 kb Xhol/Sall fragment of pl84.14.1 was cloned 
into the Xhol site of pl84.3.3 6f to generate the plasmid 
pRMVHl, which includes both the 184.14 and the 184.3 genes in 
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the same orientation. This plasmid was digested with Xhol and 
the 5.7 kb Xhol/Sall fragment of p!84.17SK was inserted to 
create the plasmid pRMVH2, which contains, from 5 1 to 3 f , the 
three VH genes 184.17, 184.14, and 184.3, all in the same 
5 orientation. The plasmid pRMVH2 was then cut with Xhol, and 
the 6.3 kb Xhol/Sall insert of p49.8f inserted to create the 
plasmid pH3VH4, which contains, from 5' to 3', the four VH 
genes 49.8, 184.17, 184.14, and 184.3, all in the same 
orientation. 

10 The 10.6 kb XhoI/EcoRV insert of the human D region 

clone pDHl (described supra; e.g., in Example 12) was cloned 
into XhoI/EcoRV digested pGPe plasmid vector to create the new 
plasmid pDHle. This plasmid was then digested with EcoRV and 
ligated with a synthetic linker fragment containing a Sail site 

15 (5 f - ccg gtc gac ccg -3 f ) . The resulting plasmid, pDHles, 

includes most of the human Dl cluster within an insert that can 
be excised with Xhol and Sail, such that the Xhol site is on 
the 5 1 end, and the Sail site is on the 3 1 end. This insert 
was isolated and cloned into the Sail site of pH3VH4 to create 

20 the plasmid pH3VH4D, which includes four germline configuration 
human VH gene segments and 8 germline configuration human D 
segments, all in the same 5 f to 3' orientation. The insert of 
this clone can be isolated, substantially free of vector 
sequences, by digestion with Notl. 

25 

pCORlxa 

The plasmid pCORl (described supra) which contains a 
32 kb Xhol insert that includes 9 human D segments, 6 human J 
segments, the JfjL intronic heavy chain enhancer, the ju. switch 

3 0 region, and the Cfj, coding exons — was partially digested with 

Xhol, Klenow treated, and a synthetic Sail linker ligated in to 
produce the new plasmid pCORlxa, which has a unique Xhol site 
at the 5 1 end and a unique Sail site at the 3* end. Both pCORl 
and pCORlxa contain a 0.6 kb rat heavy chain 3' enhancer 

35 fragment at the 3' end, which is included in the insert if the 
plasmid is digested with Notl instead of Xhol or Xhol/Sall. 



pPl-570 
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A phage PI library (Genome Systems Inc., St. Louis, 
Missouri) was screened by PCR using the oligonucleotide primer 
pair: 

5'- tea caa gec cag caa cac caa g -3 1 
5 5 1 - aaa age cag aag acc etc tec ctg -3 1 

This primer pair was designed to generate a 216 bp PCR product 
with a human 7 gene template. One of the PI clones identified 
was found to contain both the human 73 and 7I genes within an 
10 80 kb insert. The insert of this clone, which is depicted in 
Fig. 80 , can be isolated, substantially free of vector 
sequences, by digestion with NotI and Sail. 

pll-14 

15 Restriction mapping of the human 73/7I clone Pl-570 

revealed a 14 kb BamHI fragment near the 5 1 end of the insert. 
This 14 kb fragment was subcloned into the plasmid vector pGPlf 
such that the polylinker derived Sail site is adjacent to the 
5 f end of the insert. The resulting plasmid is called pB14 . 

2 0 Separately, an 11 kb Ndel/Spel genomic DNA fragment covering 
the 3 1 end of the human /x gene and the 5 1 end of the human 6 
gene, derived from the plasmid clone pJINA (Choi et al. (1993) 
Nature Genetics 4.: 117) , was subcloned into the Sail site of 
pBluescript ( Stratagene , LaJolla , CA) using synthetic 

25 oligonucleotide adapters. The resulting Sail insert was then 
isolated and cloned into the Sail site of pB14 such that the 
relative 5' to 3 1 orientation of the /x fragment from pJINA is 
the same as that of the 7 fragment from Pl-570. The resulting 
clone is called pll-14. The insert of this clone can be 

3 0 isolated , substantially free of vector sequences , by digestion 
with NotI. 

pHP-3a 

The mouse heavy chain 3' enhancer (Dariavach et al. 
35 (1991) Eur. J. Immunol. 21 : 1499; Lieberson et al. (1991) 
Nucleic Acids Res. 19: 933) was cloned from a balb/c mouse 
genomic DNA phage X library. To obtain a probe, total balb/c 



mouse thymus DNA was used as a template for PCR amplification 
using the following two oligonucleotides: 

cck7 6: 5 f - caa tag ggg tea tgg acc c -3 • 
cck77: 5 1 - tea ttc tgt gca gag ttg gc -3 1 

The resulting 220 bp amplification product was cloned 
using the TA Cloning™ Kit (Invitrogen, San Diego, CA) and the 
insert used to screen the mouse phage library. A positively 
hybridizing 5.8 kb Hindlll fragment from one of the resultant 
phage clones was subcloned into pGPlf. The orientation of the 
insert of this subclone, pHC3'ENfa, is such that the 
polylinker Xhol site is adjacent to the 5 1 end of the insert 
and the Sail site adjacent to the 3' end. Nucleotide sequence 
analysis of a portion of this Hindu fragment confirmed that it 
contained the 3' heavy chain enhancer. The insert of pHC3 f ENfa 
includes an Xhol site approximately 1.9 kb upstream of the 
EcoRl site at the core of the enhancer sequence. This Xhol 
site was eliminated by partial digestion, Klenow fill-in, and 
religation, to create the clone pH3 l Efx, which includes unique 
Xhol and Sail sites, respectively, at the 5 1 and 3' ends of the 
insert. 

The 3 1 end of the human y3/yl clone Pl-570 was 
subcloned as follows: Pl-570 DNA was digested with NotI , 
klenow treated, then digested with Xhol; and the 13 kb end 
fragment isolated and ligated to plasmid vector pGP2b which had 
been digested with BamHI, klenow treated, and then digested 
with Xhol. The resulting plasmid, pPX-3 , has lost the 
polylinker NotI site adjacent to the polylinker Xhol site at 
the 5' end of the insert; however, the Xhol site remains 
intact, and the insert can be isolated by digestion with NotI 
and Xhol, or Sail and Xhol. The 3 1 enhancer containing 
Xhol/Sall insert of pH3 f Efx was isolated and ligated into the 
3' Sail site of pPX-3 to create the plasmid pHP-3a. The 
enhancer containing fragment within the pHP-3a insert is 
ligated in the opposite orientation as the 3* end of the Pl-570 
clone. Therefore, pHP-3a contains an internal Sail site, and 
the insert is isolated by digestion with Xhol and NotI. 
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Because this is an enhancer element, 5' to 3 ' orientation is 
generally not critical for function. 



HCo7. 

5 To prepare the HCo7 DNA mixture for pronuclear 

microinjection, DNA from each of the five plasmids described 
above was digested with restriction enzymes and separated on an 
agarose gel. Clone pH3V4D was cut with NotI; pCORlxa was cut 
with NotI; pll-14 was cut with NotI; pPl-570 was cut with NotI 

10 and Sail; and pHP-3a was cut with NotI and Xhol. The DNA 
inserts were electroeluted and further purified on an 
equilibrium CsCl gradient without EtBr. The inserts were 
dialyzed into injection buffer and mixed as follows: 50 
microliters of pH3V4D insert @ 20.4 ng/microliter ; 50 

15 microliters of pCORlxa insert @ 20.8 ng/microliter; 50 
microliters of pll-14 insert @ 15.6 ng/microliter; 300 
microliters of pPl-570 insert @ 8.8 ng/microliter; 60 
microliters of pHP-3a insert @ 10.8 ng/microliter; and 1.49 ml 
injection buffer. 

20 

HCo7 transgenic animals 

The HCo7 DNA mixture was microinjected into the 
pronuclei of one-half day old embryos, and the embryos 
transferred into the oviducts of pseudopregnant females, as 

25 described by Hogan et al. (Manipulating the mouse embryo, Cold 
Spring Harbor laboratories, Cold Spring Harbor NY) . 

Tail tip DNA was isolated from 2 02 animals that 
developed from microinjected embryos. Southern blot analysis 
of this DNA, using a probe comprising human /i and DH sequences, 

3 0 revealed 22 founder animals that had incorporated at least a 
portion of the HCo7 transgene. Fig. 81 shows an analysis of 
the expression of human \i and human yl in the serum of 6 GO 
animals that developed from embryos microinjected with HCo7 
DNA. Serum levels of human immunoglobulin proteins were 

35 measured by ELISA as described in Lonberg et al. (1994) Nature 
368 : 856. Four of these six mice showed evidence of 
incorporation of the transgene by Southern blot analysis, and 
three of these mice expressed both human \i and human 71 
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proteins in their serum. The single transgenic mouse that did 
not express human immunoglobulin proteins was determined by 
Southern blot analysis to contain only a low number of copies 
of the transgene, and it is possible that the entire transgene 
5 was not incorporated, or that this mouse was a genetic mosaic. 
Two of the founder HCo7 mice, #11952 and #11959, were bred with 
human k minilocus (KCo4 line 4436) transgenic mice that were 
also homozygous for disruptions of the endogenous heavy, and k 
light chain loci (Lonberg et al. op.cit) , to generate mice that 
10 were homozygous for the two endogenous locus disruptions and 
hemizygous for the two introduced human miniloci, KCo4 and 
HCo7. Five of these so-called 

double-transgenic/double-deletion mice were analyzed for 
expression of human IgM, human IgGl, and human IgG3 . As a 

15 control, three HC2/KCo4 double-transgenic/double-deletion mice 
were included in the analysis. This experiment is presented in 
Fig. 82. The ELISA data in this figure was collected as in 
Lonberg et al. (op.cit) , except that for detection of human 
IgG3 , the coating antibody was a specific mAb directed against 

20 human IgG3 (cat. # 08041, Pharmingen, La Jolla, CA) ; the other 
details of the IgG3 assay were identical to those published for 
IgGl. While the HC2/KCo4 mice express only human IgM and human 
IgGl, the HCo7/KCo4 mice also express human IgG3 in addition to 
these two isotypes. Expression of human y3 and yl in the HCo7 

25 mice has also been detected by PCR amplification of cDNA 

synthesized from RNA isolated from the spleen of a transgenic 
mouse. Fig. 83 depicts PCR amplification products synthesized 
using spleen cDNA from three different lines of transgenic 
mice: line 2550 is an HC2 transgenic line, while lines 11959 

30 and 11952 are HCo7 transgenic lines. Single stranded cDNA was 
synthesized from spleen RNA as described by Taylor et al. 
(1992) Nucleic Acid Res. 20 : 6287. The cDNA was then PCR 
amplified using the following two oligonucleotides: 

35 o-382: 5'- gtc cag aat teg gt(c,g,t) cag ctg gtg (c,g)ag tct 
gg -3' 



o-383 : 



5'- ggt ttc teg agg aag agg aag act gac ggt cc -3' 
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This primer pair directs the synthesis of PCR products that 
spans the hinge region of human 7 transcripts. Because of 
differences in the structures of the human 71 and 73 hinge 
regions, PCR amplification distinguishes between these two 
5 transcripts, A human 71 template will direct the synthesis of 
a 752 bp PCR product, while human 73 directs the synthesis of a 
893 bp product. While only human 7I template is detectable in 
the HC2 line 2550 and HCo7 line 11959 spleens, both 71 and 73 
transcripts are detectable in the HCo7 line 11952 spleen. 

10 Because of the non-quantitative nature of this assay, and 
because of differences in 73 expression between individual 
animals (shown by ELISA in Fig. 82) , the inability to observe 
73 in the-HCo7 line 11959 spleen in Fig. 83 does not indicate 
that 73 is not expressed in this line. Isolated spleen cells 

15 from the HCo7/KCo4 mice can also be induced to express both 

IgGl and IgG3 in vitro by stimulation with LPS and IL4 . This 
experiment is shown in Fig. 84. Spleen cells from a 7 week old 
male HCo7/KCo4 double-transgenic/double-deletion mouse (#12496; 
line 11959/4436) tested for immunoglobulin secretion in 

2 0 response to the thymus- independent B cell mitogen, LPS, alone 

and in conjunction with various cytokines. Splenocytes were 
enriched for B cells by cytotoxic elimination of T cells. 
B-enriched cells were plated in 24 well plates at 2 x 10 6 cells 
per well in 2 ml of 10% FCS in RPMI-1640. LPS was added to all 
25 wells at 10 micrograms/ml. IL-2 was added at 50 units/ml, IL-4 
was added at 15 ng/ml, 1L-6 was added at 15 ng/ml, 7IFN was 
added at 100 units /ml. Cultures were incubated at 3 7 °C, 5% CQ 2 
for 10 days, then supernatants were analyzed for human IgGl and 
IgG3 by ELISA. All reagents for ELISA were polyclonal 

3 0 anti-serum from Jackson Immunologicals (West Grove, PA) , except 

the capture anti-human IgM, which was a monoclonal antibody 
from The Binding Site (Birmingham, UK) . 

EXAMPLE 38 

35 This example demonstrates the successful introduction 

into the mouse genome of functional human light chain V 
segments by co-injection of a human k light chain minilocus and 
a YAC clone comprising multiple human V K segments. The example 
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shows that the V K segment genes contained on the YAC contribute 
to the expressed repertoire of human k chains in the resultant 
mouse. The example demonstrates a method for repertoire 
expansion of transgene-encoded human immunoglobulin proteins, 
5 and specifically shows how a human k chain variable region 
repertoire can be expanded by co-introduction of unlinked 
polynucelotides comprising human immunoglobulin variable region 
segments . 

10 Introduction of functional human light chain V segments by 

co-iniection of Vk containing yeast artificial chromosome clone 
DNA and k light chain minilocus clone DNA 

I. Analysis of a yeast strain containing cloned human Vk 
gene segments . 

15 Total genomic DNA was isolated from a yeast strain 

containing a 450 kb yeast artificial chromosome (YAC) 
comprising a portion of the human V K locus (ICRF YAC library 
designation 4xl7El) . To determine the identity of some of the 
V K gene segments included in this YAC clone, the genomic DNA 

20 was used as a substrate for a series of V K family specific PCR 
amplification reactions. Four different 5 f primers were each 
paired with a single consensus 3' primer in four sets of 
amplifications. The 5 1 primers were: O-270 (5'-gac ate cag ctg 
acc cag tct cc-3 ' ) , o-271 (5'-gat att cag ctg act cag tct 

25 cc-3'), o-272 (5'-gaa att cag ctg acg cag tct cc-3') , and o-273 
(5'-gaa acg cag ctg acg cag tct cc-3'). These primers are used 
by Marks et al. (Eur. J. Immunol. 1991. 21 , 985) as V K family 
specific primers. The 3' primer, o-274 (5'-gca age ttc tgt ccc 
aga ccc act gec act gaa cc-3 9 ) , is based on a consensus 

3 0 seguence for FR3 . Each of the four sets of primers directed 
the amplification of the expected 0.2 kb fragment from yeast 
genomic DNA containing the YAC clone 4xl7El. The 4 different 
sets of amplification products were then gel purified and 
cloned into the PvuII/Hindlll site of the plasmid vector pSP72 

35 (Promega) . Nucleotide seguence analysis of 11 resulting clones 
identified seven distinct V genes. These results are presented 
below in Table 14 . 
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Table 14 . Identification of human V K segments on the YAC 
4X17E1. 



PCR primers clone # identified V/c family 

gene 



O-270/O-274 


1 


L22* 


I 


~~ ?! ~* 


4 


L22* 


T 

I 


*~ ti ~ 


/ 




T 


0-271/0-274 


11 


A10* 


VI 


it — 


15 


A10* 


VI 


0-272/0-274 


20 


A4* or A20 


I 


— it "™* 


21 


All* 


III 


ii ~ ~ 


22 


All* 


III 


— ir ~ 


23 


All* 


III 


™ ii ™ 


25 


04* or 014 


I 


0-273/0-274 


36 


L16* or L2 


III 



* Gene segments mapped within the distal V K cluster (Cox et al. 
Eur. J* Immunol. 1994. 24, 827; Pargent et al. Eur. J. Immunol. 
1991. 21, 1829; Schable and Zachau Biol. Chem. Hoppe-Seyler 
20 1993. 374, 1001) 

All of the sequences amplified from the YAC clone are 
either unambiguously assigned to V x genes that are mapped to 
the distal cluster, or they are compatible with distal gene 

25 sequences. As none of the sequences could be unambiguously 
assigned to proximal V genes, it appears that the YAC 4xl7El 
includes sequences from the distal Vic region. Furthermore, one 
of the identified sequences, clone #7 (Vk02) , maps near the J 
proximal end of the distal cluster, while another sequence, 

30 clones # 1 and 4 (VkL22) , maps over 300 kb upstream, near the J 
distal end of the distal cluster. Thus, if the 450 kb YAC 
clone 4xl7El represents a non-deleted copy of the corresponding 
human genome fragment, it comprises at least 3 2 different V K 
segments. However, some of these are non-functional 

3 5 pseudogenes . 
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2 . Generation of transgenic mice containing YAC 
derived V K gene segments. 

To obtain purified YAC DNA for microinjection into 
embryo pronuclei, total genomic DNA was size fractionated on 
5 agarose gels. The yeast cells containing YAC 4xl7El were 

imbedded in agarose prior to lysis, and YAC DNA was separated 
from yeast chromosomal DNA by standard pulse field gel 
electrophoresis (per manufacturers specifications: CHEF DR-II 
electrophoresis cell, BIO-RAD Laboratories, Richmond CA) . Six 
10 individual pulse field gels were stained with ethidium bromide 
and the YAC clone containing gel material was cut away from the 
rest of the gel. The YAC containing gel slices were then 
imbedded in a new (low melting temperature) agarose gel cast in 
a triangular gel tray. The resulting triangular gel was 
15 extended at the apex with a narrow gel containing two 

moles/ liter sodium acetate in addition to the standard gel 
buffer (Fig. 85) . 

The gel was then placed in an electrophoresis chamber 
immersed in standard gel buffer. The "Y" -shaped gel former 

2 0 rises above the surface of the buffer so that current can only 

flow to the narrow high salt gel slice. A Plexiglas block was 
placed over the high salt gel slice to prevent diffusion of the 
NaOAc into the gel buffer. The YAC DNA was then 
electrophoresed out of the original gel slices and into the 
25 narrow high salt block. At the point of transition from the 
low salt gel to the high salt gel, there is a resistance drop 
that effectively halts the migration of the YAC DNA through the 
gel. This leads to a concentration of the YAC DNA at the apex 
of the triangular gel. Following electrophoresis and staining, 

3 0 the concentrated YAC DNA was cut away from the rest of the DNA 

and the agarose digested with GELase (EPICENTRE Technologies) . 
Cesium chloride was then added to the YAC DNA containing liquid 
to obtain a density of 1.68 g/ml. This solution was 
centrifuged at 37,000 rpm for 36 hrs to separate the DNA from 
35 contaminating material. 0 . 5 ml fractions of the resulting 
density gradient were isolated and the peak DNA containing 
fraction dialyzed against 5 mM tris (pH 7.4) /5 mM NaCl/0.1 M 
EDTA. Following dialysis, the concentration of the resulting 
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0.65 ml solution of YAC DNA was found to be 2 micrograms /ml • 
This DNA was mixed with purified DNA insert from plasmids pKClB 
and pKV4 (Lonberg et al. 1994. Nature 368, 856) at a ratio of 
20: 1: 1 (micrograms YAC4xl7El: KC1B: KV4 ) . The resulting 2 
5 microgram/ml solution was injected into the pronuclei of 

half-day mouse embryos, and 95 surviving microinjected embryos 
transferred into the oviducts of pseudo-pregnant females. 
Thirty nine mice were born that developed from the 
microinjected embryos. Two of these mice, #9269 and #9272, 

10 were used to establish transgenic lines. The lines are 
designated KCo5-9269 and KCo5-9272. 

A Southern blot analysis of genomic DNA from mice of 
lines KCo5-9269 and KCo5-9272 was carried out to determine if 
YAC 4X17E1 derived V K segments had been incorporated in their 

15 genomes. A V K gene segment, VkAlO (accession #: X12683; 
Straubinger et al. 1988. Biol. Chem. Hoppe-Seyler 369, 
601-607) , from the middle of the distal V K cluster was chosen 
as a probe for the Southern blot analysis. To obtain the 
cloned probe, the VkAlO gene was first amplified by PCR. The 

20 two oligo nucleotides, o-337 (5»- egg tta aca tag ccc tgg gac 
gag ac -3 1 ) and o-3 38 (5'- ggg tta act cat tgc etc caa age ac 
-3 • ) , were used as primers to amplify a 1 kb fragment from YAC 
4X17E1. The amplification product was gel purified, digested 
with Hindi, and cloned into pUC18 to obtain the plasmid 

25 pl7E!A10. The insert of this plasmid was then used to probe a 
southern blot of KCo5-9269 and KCo5-9272 DNA. The blot showed 
hybridization of the probe to the expected restriction 
fragments in the KCo5-9272 mouse DNA only. This indicates that 
the VkAlO gene is incorporated into the genome of KCo5-9272 

30 mice and not KCo5-9269 mice. Line KCo5-9272 mice were then 

bred with HC2-2550/JHD/JKD mice to obtain mice homozygous for 
disruptions of the endogenous heavy and k light chain loci, and 
hemi- or homozygous for the HC2 and KCo5 transgenes. Animals 
that are homozygous for disruptions of the endogenous heavy and 

35 k light chain loci, and hemi- or homozygous for human heavy and 
k light chain transgenes are designated double 
transgenic/double deletion mice. 
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A cDNA cloning experiment was carried out to 
determine if any of the YAC-derived V K genes are expressed in 
line KCo5-9272 mice. The double transgenic/double deletion 
mouse #12648 (HC2-2550/KCO5-9272/JHD/ JKD) was sacrificed and 
5 total RNA isolated from the spleen. Single stranded cDNA was 
synthesized from the RNA and used as a template in four 
separate PGR reactions using oligonucleotides O-270, o-271, 
o-272, and o-27 3 as 5 1 primers, and the Ck specific 
oligonucleotide, o-18 6 (5 1 - tag aag gaa ttc age agg cac aca aca 
10 gag gca gtt cca -3 r ) , as a 3 1 primer. The amplification 
products were cloned into the pCRII TA cloning vector 
(Invitrogen) . The nucleotide sequence of 19 inserts was 
determined. The results of the sequence analysis are 
summarized in Table 15 below. 



239 

Table 15. Identification of human Vk genes expressed in mouse 
line KCoS-9272. 



PCR primers clone # identified Vk family 

gene 



5 


0-270/0-186 


1 


L15* 


I 




it ~ 


3 


L18** 


I 




— ?i ~ 


7 


L24** 


I 




— „ -** 


9 


L15* 


I 




— i? ~ ~ 


10 


L15* 


I 


10 


0-271/0-186 


15 


A10** 


VI 




~~ ii *~ 


17 


A10** 


VI 




~ »i — 


18 


A10** 


VI 




— tt ~* 


19 


A10** 


VI 






21 


A10** 


VI 


15 


o-272/o-186 


101 


A27* 


III 




~ ii ~~ 


102 


L15* 


I 




_ !1 ~ " 


103 


A27* 


III 




"~ If ~~ 


104 


A27* 


III 




0-273/0-186 


35 


A27* 


III 


20 


*** ii — 


38 


A27* 


III 






44 


A27* 


III 




~ ii ~ 


45 


A27* 


III 




*~ it ~ 


48 


A27* 


III 



25 * Vk genes encoded by transgene plasmid sequences. 

** Vk genes encoded uniquely by YAC derived transgene sequences. 



These results show that at least 3 of the YAC 
derived V K gene segments, A10, L18, and L24, contribute 
30 to the expressed human repertoire of the line KCo5-9272 

mice. 

To determine the effect of this increased 
repertoire on the size of the various B220 + cell 
populations in the bone marrow and spleen, a flow 
35 cytometric analysis was carried out on line KCo5-9272 



mice. Part of this analysis is shown in Figs. 86 and 87. 
Two double transgenic/double deletion mice, one 
containing the KCo5 transgene, and one containing the 
KCo4 transgene, are compared in this experiment. These 
two transgenes share the same joining and constant region 
sequences, as well as the same intronic and 3' enhancer 
sequences. They also share four different cloned V gene 
segments; however, the KCo5 transgene includes the 
additional V segments derived from YAC 4xl7El that are 
not included in the KCo4 transgene. Cells were isolated 
from mouse #13534 (HC2-2550/KCo5— 9272/ JHD/ JKD) and mouse 
#13449 (HC2-2550/KCo4-443 6/ JHD/ JKD) . Bone marrow cells 
were stained with anti-mouse B22 0 (Caltag, South San 
Francisco, CA) , anti-mouse CD43 (Pharmingen, La Jolla, 
CA) , and anti-human IgM (Jackson Immunologic, West Grove, 
PA) . Spleen cells were stained with anti-mouse B220 and 
ant i -human IgM. 

Fig. 86 shows a comparison of the B cell, and B 
cell progenitor populations in the bone marrow of KCo5 
and KCo4 mice. The fraction of B cells in the bone 
marrow (B220 + , IgM + ) is approximately three times higher 
in the KCo5 mice (6%) than it is in the KCo4 mice (2%) . 
The pre-B cell population (B220 + , CD43", IgM) is also 
higher in the KCo5 mice (9%, compared to 5% for KCo4) . 
Furthermore, the pro-B compartment (B22CT, CD43 + ) is 
elevated in these mice (11% for KCo5 and 5% for KCo4) . 
Although each of these three compartments is larger in 
the KCo5 mice than it is in the KCo4 mice, the levels are 
still approximately half that found in wild type mice. 
The increase in the number of bone marrow B cells is 
presumably a direct consequence of the increased 
repertoire size. The larger primary repertoire of these 
mice may provide for membrane Ig with some minimal 
threshold affinity for endogenous antigens. Receptor 
ligation could then allow for proliferation of those B 
cells expressing the reactive Ig. However, because the 
pre-B and pro-B cells do not express light chain genes, 
the explanation for the increased sizes of these two 



compartments in the KCo5 mice is not immediately 
apparent. The B cell progenitor compartments may be 
larger in KCo5 mice because the increased number of B 
cells creates a bone marrow environment that is more 
conducive to the expansion of these populations. This 
effect could be mediated directly by secreted factors or 
by cell-cell contact between B cells and progenitor 
cells, or it could be mediated indirectly, by titration 
of factors or cells that would otherwise inhibit the 
survival or proliferation of the progenitor cells. 

Fig. 87 shows ; a comparison of the splenic B 
cell (B220 + , IgM + ) populations in KCo5 and KCo4 mice. The 
major difference between these two mice is the relative 
sizes of B220 duU B cell populations (6% in the KCo5 mice 
and 13% in the KCo4 mice) . The B220 duU cells are larger 
than the 622 0^ B cells, and a higher fraction of them 
express the 1 light chain. These are characteristics of 
the so-called Bl population that normally dominates the 
peritoneal B cell population in wild type mice. The 
spleens of the KCo4 mice comprise an anomolously high 
fraction of B220 du11 cells, while the KCo5 mice have a more 
normal distribution these cells. However, both strains 
contain approximately one-half to one-third the normal 
number of B cells in the spleen. 

EXAMPLE 3 9 

This example demonstrates the successful use of 
KCo5 transgenic mice of Example 3 8 to isolate hybridoma 
clones that secrete high affinity, antigen specific, 
human IgG monoclonal antibodies. 

Immunization. A double deletion/double transgenic mouse 
(KCO5-9272/HC2-2550/JHD/JKD, #12657) was immunized 
intraperitoneally every other week for eight weeks with 4 
to 10 x 10 6 irradiated T4D3 cells, a murine T cell line 
expressing human CD4 (Dr. Jane Parnes, Stanford 
University) followed by one injection intraperitoneally 



two weeks later of 20 mg soluble recombinant human CD 4 
(sCD4; Intracell) in incomplete Fruend's adjuvant 
(Sigma) ♦ The mouse was boosted once 3 days prior to 
fusion with 2 0 mg sCD4 intravenously. 

Hybridoma fusion. Single cell suspensions of splenic 
lymphocytes from the immunized mouse were fused to 
one-sixth the number of P3X63-Ag8 . 653 nonsecreting mouse 
myeloma cells (ATCC CRL 1580) with 50% PEG (Sigma) . 
Cells were plated at approximately 2 X 10 5 in flat bottom 
microtiter plates, followed by a two week incubation in 
selective medium containing 2 0% Fetal Clone Serum 
(HyClone) , 18% "653" conditioned medium, 5% Origen 
(IGEN) , 4 mM L-glutamine, 1 mM sodium pyruvate, 5 mM 
HEPES, 0.055 mM 2-mercaptoethanol , 50 units/ml mM 
penicillin, 50 mg/ml streptomycin, 50 mg/ml mM gentamycin 
and IX HAT (Sigma; the HAT was added 24 hrs after the 
fusion) . After two weeks, cells were cultured in medium 
in which the HAT was replaced with HT. Wells were 
screened by ELISA and flow cytometry once extensive 
hybridoma growth or spent medium was observed. 

Hvbridoma screening bv ELISA. To detect anti-CD4 mAbs, 
microtiter plates (Falcon) were coated overnight at 4°C 
with 50 ml of 2.5 mg/ml of sCD4 in PBS, blocked at RT for 
1 hr with 10 0 ml of 5% chicken serum in PBS, and then 
sequentially incubated at RT for 1 hr each with 1:4 
dilutions of supernatant from hybridomas, 1:1000 dilution 
of F(ab') 2 fragments of horseradish peroxidase 
(HRPO) -conjugated goat anti-human IgG (Jackson) or 1:250 
dilution of HRPO-conjugated goat anti-human Igk 
antibodies (Sigma) plus 1% normal mouse serum, and 
finally with 0.22 mg/ml ABTS in 0.1 M citrate phosphate 
buffer, pH 4 with 0.0024% H 2 0 2 . Plates were washed 3-6 
times with wash buffer (0.5% Tween-20 in PBS) between all 
incubations, except the first. Diluent (wash buffer with 
5% chicken serum) was used to dilute the supernatants and 
the HRPO conjugates. Absorbance was measured using dual 



wavelengths (OD at the reference wavelength of 490 nin was 
subtracted from the OD at 415 nm) . 

To detect mouse X-containing mAbs, the above 
ELISA protocol was used, with the following exceptions. 
Wells of microtiter plates were coated with 100 ml of 1) 
1.25 mg/ml goat anti-mouse X (Pierce), 2) 1.25 mg/ml goat 
anti-human Fey (Jackson), or 3) 2.5 mg/ml sCD4 (ABT) . 
For the detection step, 100 ml of 1:5000 goat anti-mouse 
1 (SB A) conjugated to biotin was used followed by 100 ml 
of 1:1000 streptavidin conjugated to HRPO (Jackson). 
Murine and human mAb standards were used at the indicated 
concentrations. To look for cross-reactivity to 
unrelated antigens, wells were coated with CEA (Crystal 
Chem) , KLH (CalBiochem) , HSA (Sigma) , BSA (Sigma) or OVA 
(Sigma; all at 2 mg/ml, except CEA which was at 2.5). 
Appropriate antibodies were titered and used as positive 
controls (human IgM anti-CEA (GenPharm) , rabbit anti-KLH 
(Sigma) , sheep anti-HSA (The Binding Site) , sheep 
anti-BSA (The Binding Site) , and sheep anti-OVA (The 
Binding Site) ) . Any bound antibody was detected with 
HRPO conjugates of goat anti-human IgM, donkey 
anti-rabbit IgG or donkey anti-sheep IgG (all diluted 
1:1000 and obtained from Jackson). Otherwise, the 
standard ELISA protocol was followed. 

Hybridoma screening by flow cytometric assay. To further 
screen for mAbs reactive with native cell-surface CD4, 5 
x 10 5 SupTl cells (ATCC CRL 1942) were incubated on ice 
with a 1:2 dilution of spent supernatant from the fusion 
plates for 3 0 min, washed twice with cold stain buffer 
(0.1% BSA, 0.02% NaN 3 in PBS), incubated with 1.5 mg/ml of 
an F(ab') 2 fragment of FITC-conjugated goat anti-human Fcg 
(FITC-GaHuIgG; Jackson) for 15 min, washed once and 
analyzed immediately on a FACScan (Becton-Dickinson) . 

CD4 reactive hvbridomas. Using the ELISA and flow 
cytometric techniques described above, 12 hybridoma 
clones were identified that secreted human IgG 



specifically reactive with native human CD4 . Ten of 
these twelve clones were further subcloned. Eight of 
these subclones were identified as human IgGl* secreting 
hybridomas. The other two expressed a mouse X light 
chain. The parent wells for the 8 fully human clones 
were: 1E11, 2E4, 4D1, 6C1, 6G5, 7G2 , 10C5, and 1G1. Flow 
cytometric assays of the binding of 3 of the fully human 
IgGk subclones (4D1.4, 6G5.1, and IOCS . 6) are shown in 
Fig, 88. 

Fig. 88 shows binding of IgG/c anti-nCD4 
monoclonal antibodies to CD4 + SupTl cells. Cells from 
log phase growth cultures were washed and stained with no 
monoclonal antibody, 4E4.2 (as a negative control), 
chimeric Leu3a (as a positive control) , or with one of 
the 10 human IgG anti-nCD4 monoclonal antibodies. Any 
bound monoclonal antibody was detected with 
FITC-conjugated goat anti-human Fey • All ten monoclonal 
antibodies bound to SupTl cells, although data is shown 
here for only three of them. 

Analysis of human antibody secretion by cloned 
hybridomas. To compare the growth and secretion levels 
of mAbs, the subclones were put into replicate cultures 
in HT medium in 2 4 well plates at an initial density of 2 
x 10 5 cells/ml. Each day for 7 days, one of the 
replicate cultures for each subclone was harvested and 
cell numbers, cell viability (by Trypan blue exclusion) 
and the amount of mAb in the supernatant (by a 
quantitative ELISA for total human 7) were determined. 
Table 16 shows data for antibody secretion by 7 of the 
hybridoma subclones . 

Table 16. Secretion Levels For Human IgGk Anti-nCD4 
Monoclonal Antibodies 

Subclone pg/cell pg/cell/d 

1E11. 15 379 0.56 

1G1.9 11 1.5 

4D1.4 1.4 0.91 
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6C1.10 3.3 0.48 

6G5.1 7.8 1.1 

7G2.2 4.4 0.63 

10C5.6 8.0 1.1 

* pg/cell = (maximum amount of mAb) / (maximum number of 
viable cells) pg/cell/d = (pg/cell) /7 days 



Purification of human mAbs. The individual hybridoma 

10 clones were grown in medium without HT and Origen and the 
FCS was gradually decreased to approximately 2-3% in the 
final 1 1 cultures. Supernatants were harvested once the 
viability of the hybridomas fell below approximately 30%. 
To purify the IgGk mAbs, the spent supernatants were 

15 centrifuged to remove cells, concentrated via 

ultrafiltration to approximately 50 to 100 mis, diluted 
1:5 with PBS, pH 7.4 and loaded onto a 5 ml Protein A 
(Pharmacia) column. After washing with 3-5 column 
volumes of PBS, the human IgGk mAbs were eluted with 0.1 

20 HC1, 150 mM NaCl, pH 2 . 8 and immediately neutralized with 
1M Tris base. Column fractions containing material with 
an OD^ > 0.2 were pooled and dialyzed into PBS . The OD^ 
was then determined and an absorbtivity coefficient of 
1.4 was used to calculate the protein concentration of 

25 the human IgG. No mAb was detected in the flow through 

and the % recoveries ranged from 93 to 100%. Three to 
six mgs of each purified mAb were obtained, with >90% 
purity. 

3 0 Analysis of monoclonal antibodies from cloned hybridomas. 
To investigate the specificity of binding of mAbs, human 
PBMC were isolated over Ficoll and stained as follows. 
Human PBMC (10 6 ) in stain buffer were incubated for 3 0 min 
on ice, in separate reactions, with equal volumes of 

3 5 supernatant from each of three of the subcloned 

hybridomas (4D1.4, 6G5.1, and IOCS. 6), or with an isotype 
matched negative control mAb, washed twice, and incubated 
2 0 min on ice with 1 mg/ml of FITC-GaHuIgG along with 
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either 10 ml of mouse anti-human CD 4 mAb (Leu3a; 
Becton-Dickinson) conjugated to phycoerythrin (PE) , 10 ml 
of mouse anti-human CD8 mAb (Leu2a; Becton-Dickinson) 
conjugated to PE, or 5 ml of mouse anti-human CD19 mAb 
(SJ25-C1; Caltag) conjugated to PE. Gated lymphocytes 
were then analyzed on a FACScan flow cytometer (Becton 
Dickinson, San Jose, CA) . All three of the antibodies 
were found to bind specifically to the CD4 fraction of 
the human PBMC. 

To approximate the location of the epitope 
recognized by these three mAbs, 5 x 10 5 SupTl cells were 
pre- incubated for 20 min on ice with buffer, 2.5 mg/ml 
RPA-T4, or 2,5 mg/ml Leu3a in stain buffer, then for 30 
min with one of the 10 human IgG mAbs (in supernatant 
diluted 1:2) and finally with 0.5 mg/ml FITC-conjugated 
goat anti-human Fey to detect any bound human IgG. Cells 
were washed twice with stain buffer prior to and once 
after the last step. The results of this blocking assay 
are shown in Fig. 89. None of the three antibodies share 
an epitope with RPA-T4, while 6G5 . 1 and 10C5 . 6 appear to 
recognize the same (or an adjacent) epitope as that 
recognized by Leu3a. 

Rate and equilibrium constant determinations. 

Human sCD4 (2500 to 4200 RU) was immobilized by 
covalent coupling through amine groups to the sensor chip 
surface according to manufacturer's instructions. 
Antibody dilutions were flowed over the antigen-coupled 
sensor chips until equilibrium was reached, and then 
buffer only was allowed to flow. For each phase of the 
reaction, binding and dissociation, the fraction of bound 
antibody was plotted over time. The derivative of the 
binding curve (dR/dt) was calculated and plotted against 
the response for each concentration. To calculate the 
association rate constant (k assoc ) , the slopes of those 
resulting lines were then plotted against the 
concentration of the monoclonal antibody. The slope of 
the line from this graph corresponded to the k^. The 



dissociation rate constant (k disaoc ) was calculated from the 
log of the drop in response (during the buffer flow 
phase) against the time interval. The Ka was derived by- 
dividing the by the k^. The measured rate and 
affinity constant data for 5 different purified 
monoclonal antibodies derived from the KCo5/HC2 double 
transgenic/double deletion mice, and one purified 
antibody obtained from a commercial source (Becton 
Dickinson, San Jose, CA) , is presented in Table 17. 



Table 17. Rate and affinity constants for monoclonal 
antibodies that bind to human CD4. 
Hybridoma Antibody Source k^ (M^S 1 ) k diss ( S' 1 ) Ka (M -1 ) 



1E11. 15 


human 
IgGlk 


HC2/KCo5 
transgenic 


2.7 x 10 3 


4.6 x 10' 3 


5.8 x 10 9 


1G1.9 


human 
IgGlk 


HC2 /KCo5 
transgenic 


9.1 x 10* 


2.2 x lO" 5 


4.2 x 10* 


4D1.4 


human 
IgGlk 


HC2/KCo5 
transgenic 


9.8 x 10* 


4.2 x 10 3 


2.3 x 10 9 


6G5.1 


human 
IgGlk 


HC2/KCo5 
transgenic 


1.1 x 10 3 


1.0 x 10' 5 


1.1 x 10 10 


10C5.6 


human 
IgGlk 


HC2/KCo5 
transgenic 


7.4 x 10 4 


1.6 x 10' 5 


4.5 x 10 9 


Leu3a 


mouse 
IgGlk 


Becton 
Dickinson 


1.5 x 10 5 


4.2 x 10* 


3.7 x 10 10 



Mixed Lymphocyte Reaction (MLR) . To compare the in vitro 
efficacy of the human monoclonal antibody 10C5.6, derived 
from the KCo5 transgenic mouse, to that of the mouse 
antibody Leu3a, an MLR assay was performed. Human PBMC 
from 2 unrelated donors were isolated over Ficoll and 
CD4+ PBL from each donor were purified using a CD4 column 
(Human CD4 Cellect, Biotex Laboratories, Inc., Canada) 
according to manufacturer's directions. Inactivated 
stimulator cells were obtained by treating PBMC from both 
donors with 100 mg/ml mitomycin C (Aldrich) in culture 
medium (RPMI 164 0 with 10% heat-inactivated human AB 
serum (from NABI) , Hepes, sodium pyruvate, glutamine, 
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pen/strep and b-mer captoethano 1 (all used at 
manufacturer's recommended concentrations)) for 3 0 min at 
37°C followed by 3 washes with culture medium. Varying 
concentrations of mAbs diluted in culture medium or 
culture medium only were sterile filtered and added at 
100 ml per well in triplicate in a 96 well round bottom 
plate. Fifty ml of 10 5 CD4+ PBL from one donor in 
culture medium and 10 5 mitomycin C-treated PBMC from the 
other donor in 50 ml of culture medium were then added to 
each well. Control plates with CD4 + PBL responders alone 
plus mAbs were set up to control for any toxic or 
mitogenic effects of the mAbs. A stimulator only control 
and a media background control were also included. After 
seven days in a 37°C, 5% C0 2 humidified incubator, 100 ml 
of supernatant from each well was removed and 2 0 ml of 
colorimetric reagent (Cell Titer 96AQ kit, Promega 
Corporation, Madison, WI) was added. Color was allowed 
to develop for 4 to 6 hrs and plates were read at 490 nm. 
The results of this experiment, depicted in Fig. 90, show 
that the human IgGlk antibody IOCS. 6 is at least as 
effective as Leu3a at blocking the function of human PBMC 
CD4 cells in this assay. 

Example 40. 

Binding Characteristics of Human IaGkappa Anti-CD4 
monoclonal Antibodies. 
This example provides the binding 
characteristics of human IgG/c monoclonal antibodies 
derived from hybridoma clones obtained from 
HC2/KCo5/JHD/JCKD transgenic mice immunized with human 
CD4. The monoclonal antibodies are shown to have high 
avidity and affinity for recombinant and natural human 
CD4. 

Cells from 10 individual hybridoma cell lines 
(1E11, 1G2, 6G5, 10C5, 1G1 , 6C1, 2E4, 7G2 , 1F8 and 4D1) 
that secrete human IgG kappa monoclonal antibodies (mAB) 
reactive with human CD 4 , were derived from 
JHD/ JCKD/HC2 /KCo5 transgenic mice. The cell lines were 



249 

grown in culture, and antibody proteins were isolated 
from the supernatant (Fishwild, et al. 1996, Nature 
Biotechnology 14, 845-851, which is incorporated herein 
by reference) . Antibody purified by Protein A affinity 
chromatography was used to measure binding constants. 
The results are displayed in Tables 18 and 19. 

The rate and equilibrium constants presented in 
Table 18 were determined with a BIAcore (Pharmacia 
Biosensor) using goat anti-human IgG (Fc-specif ic) 
coupled to the sensor chip and flowing a saturating 
concentration of mAb over followed by various 
concentrations of antigen (rCD4) . These constants were 
derived from three experiments using purified mAbs. 

Table 18. Affinity and Rate Constants. 

Rate Constants (mean ± SD) 



Human mAb k assoc (M" 1 s~ 1 ) *dissoc ( s X > K a < M X ) 



1E11. 15 


1.7 


(± 


0. 15) 


X 


10 5 


3.5 


(± 


0. 


09) 


X 


10" 3 


5. 


0 


X 


10 7 


6C1. 10 


1.8 


(± 


0.44) 


X 


10 5 


3 . 3 


(± 


0. 


04) 


X 


10" 3 


5. 


4 


X 


10 7 


1G1.9 


1.2 


(± 


0.18) 


X 


10 5 


9.4 


(± 


0. 


22) 


X 


IO" 4 


1. 


3 


X 


10 8 


6G5. 1 


9.3 


(± 


1.1) 


X 


10 4 


6.9 


(± 


0. 


36) 


X 


10" 4 


1. 


4 


X 


10 8 


IOCS. 6 


9.4 


(± 


0. 98) 


X 


10 4 


7.1 


(± 


0. 


36) 


X 


io- 4 


1. 


3 


X 


10 8 


2E4.2 


1.8 


(± 


0.10) 


X 


10 5 


2 . 5 


(± 


0. 


05) 


X 


10" 3 


7 . 


1 


X 


10 7 


4D1.4 


2.5 


(± 


0.55) 


X 


10 5 


3 . 4 


(± 


0. 


15) 


X 


10" 3 


7 . 


3 


X 


10 7 


7G2.2 


2 . 4 


(± 


0.31) 


X 


10 5 


3.3 


(± 


0. 


07) 


X 


10" 3 


7. 


3 


X 


10 7 


1F8.3 


1.8 


(± 


0.24) 


X 


10 5 


4.3 


(± 


0. 


14) 


X 


10~ 3 


4. 


3 


X 


10 7 


1G2 . 10 


2.2 


(± 


0.26) 


X 


10 5 


2.3 


(± 


0. 


03) 


X 


IO" 3 


9. 


8 


X 


IO 7 


chi Leu3a 


1.5 


(± 


0.35) 


X 


10 5 


2.3 


(± 


0. 


12) 


X 


IO" 4 


6. 


6 


X 


10 8 



The rate and equilibrium constants presented in 
Table 19 were determined with a BIAcore, using antigen 
(rCD4) coupled to the sensor chip and flowing mAb over. 
These constants were derived from at least three 
independent experiments using purified mAbs. 
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Table 19. Avidity and Rate Constants 



Rate Constants (mean ± SD) 



Human mAb k assoc (M^s" 1 ) ^dissoc (s" 1 ) K a 



1E11.15 


2. 


8 


(± 


0. 


22) 


X 


10 5 


4. 


5 


(± 


0. 


43) 


X 


10- 5 


6 


.2 


X 


10 9 


6C1.10 


2. 


0 


(± 


0. 


25) 


X 


10 5 


4. 


0 


(± 


0. 


63) 


X 


10- 5 


5 


. 1 


X 


10 9 


1G1. 9 


9 . 


1 


(± 


U • 


y o ) 


X 






•5 
Z 


/ 4. 

{X 






A. 


1 0~ 5 


4 


. 2 




10 9 


6G5. 1 


1. 


1 


(± 


0. 


41) 


X 


10 5 


1. 


0 


(± 


0. 


34) 


X 


10- 5 


1 


. 1 


X 


10 10 


10C5. 6 


7. 


4 


(± 


1. 


5) 


X 


10 4 


1. 


6 


(± 


0. 


57) 


X 


10" 5 


4 


.5 


X 


10 9 


2E4.2 


1. 


4 


(± 


0. 


15) 


X 


10 5 


2. 


2 


(± 


0 . 


25) 


X 


10" 5 


6 


. 3 


X 


10 9 


4D1.4 


9. 


8 


(± 


0. 


69) 


X 


10 4 


4. 


2 


(± 


1. 


3) 


X 


10- 5 


2 


. 3 


X 


10 9 


7G2.2 


1. 


7 


(± 


0. 


20) 


X 


10 5 


5. 


0 


(± 


0. 


42) 


X 


10" 5 


3 


.4 


X 


10 9 


1F8.2 


1. 


7 


(± 


0. 


13) 


X 


10 5 


9 . 


7 


(± 


1. 


2) 


X 


10- 5 


1 


.7 


X 


10 9 


1G2 . 10 


1. 


7 


(± 


0. 


04) 


X 


10 5 


6. 


3 


(± 


0. 


49) 


X 


10- 5 


2 


.7 


X 


10 9 


chi Leu3a 


4. 


0 


(± 


0. 


45) 


X 


10 5 


1. 


2 


(± 


0. 


25) 


X 


10- 5 


3 


.4 


X 


10 10 


Leu 3 a 


1. 


5 


(± 


0, 


,30) 


X 


10 5 


4 . 


2 


(± 


0. 


49) 


X 


10~ 6 


3 


.7 


X 


10 10 



Table 2 0 provides equilibrium constants for 
anti-CD4 mABs presented in the scientific literature. 
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Table 20. Avidity and Rate Constants Report ed for Anti- 
CD4 monoclonal antibodies 



Rate Constants (mean ± SD) 



5 



10 



Human mAb 




k dissoc ( s 1 ) 


K a (M- 


X ) 


CE9. 1 (4) 


NR* 


NR 


3.1 x 


10 10 


cMT412 (1) 


NR 


NR 


5.0 x 


10 9 


chi Leu3a (2) 


NR 


NR 


1.0 x 


10 11 


BL4< 3 > 


NR 


NR 


5.5 x 


10 7 


BB14 (3) 


NR 


NR 


3.3 x 


10 8 


cA2< 5 > 


NR 


NR 


1.8 x 


10 9 


CDP571< 6) 


NR 


NR 


7.1 x 


10 9 



15 

* NR = not reported 

(1) J. Cell. Biol. 15E:A179. 

(2) J. Immunol. 145:2839. 

(3) Clin. Immunol. Immunopath. 64:248. 
2 0 (4) Biotechnology. 10: 1455. 

(5) Mol. Immunol. 30:1443. 

(6) European Patent Appl. #0626389A1. 



25 The avidity and affinity determinations 

described above were performed with recombinant CD4 
(rCD4) . To determine the avidity of the human monoclonal 
antibodies for native CD4 (nCD4) . An additional binding 
assay was used that does not require the antibody to be 

3 0 modified. Specifically, serial dilutions of antibody 

were incubated with SupTl cells for 6 hr on ice, washed 
and detected any bound antibody with FITC-goat anti-human 
Fcy. The Ka is determined from the concentration of 
antibody that gives one-half of the maximum fluorescence 

35 (a four parameter fit was used) . The results demonstrate 

that all ten human monoclonal antibodies bind very well 
to nCD4, with Ka values >10 9 M" 1 (Table 21). Most 
antibodies, including chimeric Leu3a, bound less well to 
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nCD4 than to rCD4 . This could be due to differences in 
antigen density as well as to differences between the two 
antigens* 



Table 21. Avidity Constants Determined 
bv Flow Cytometry. 



Ka values (M-l) Ratio of Ka 

Human mAb rCD4 nCD4* (rCD4/nCD4) 

10 



1E11. 15 


6.2 


X 


10 9 


3.3 


X 


10 9 


1.9 


6C1. 10 


5.1 


X 


10 9 


3.1 


X 


10 9 


1.6 


1G1.9 


4.2 


X 


10 9 


2.3 


X 


10 9 


1.9 


6G5.1 


1.1 


X 


10 10 


1.9 


X 


10 9 


5.9 


10C5.6 


4.5 


X 


10 9 


1.8 


X 


10 9 


2.5 


2E4.2 


6.3 


X 


10 9 


1.1 


X 


10 9 


5.8 


4D1.4 


2.3 


X 


10 9 


2 . 0 


X 


10 9 


1.2 


7G2.2 


3.4 


X 


10 9 


3 . 3 


X 


10 9 


1.0 


1F8.2 


1.7 


X 


10 9 


3.2 


X 


10 9 


0.5 


1G2 . 10 


2.7 


X 


10 9 


1.9 


X 


10 9 


1.4 


chi Leu3a 


3.4 


X 


10 10 


5.6 


X 


10 9 


6 . 1 



* Human monoclonal antibodies were incubated in serial dilutions with 
25 SupTl cells for 6 hrs f washed twice and incubated with FITC- 

conjugated goat anti-human Fey antisera, washed and fixed. The Ka 
was calculated from the concentration of antibody yielding one-half 
of the maximum fluorescence as determined from a four-parameter fit. 
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Example 41. 

Identification of Nucleotide Sequences Encoding Human 
IaGkappa Anti-CD4 Antibodies. 



35 



This example demonstrates that a each of the 
hybridomas tested produces only one functional heavy or 
light chain RNA transcript, consistent with proper 



functioning allelic exclusion. In addition, sequence 
analysis of heavy and light chain CDR segments indicates 
that somatic mutation of the immunoglobulin transgenes 
has taken place. 

Cells from five individual hybridoma cell lines 
(1E11, 1G2, 6G5, 10C5, and 4D1) that secrete human IgG 
kappa monoclonal antibodies reactive with human CD4, and 
derived from JHD / JCKD / HC 2 / KC o 5 transgenic mice, were used 
to isolate RNA encoding each of the individual antibodies 
(Fishwild et al. 1996, Nature Biotechnology 14, 845-851). 
The RNA was used as a substrate to synthesize cDNA, which 
was then used to amplify human Ig gamma and kappa 
transcript sequences by PCR using primers specific for 
human VH, Vkappa, Cgamma, and Ckappa (Taylor et al. 1992, 
Nucleic Acids Res. 20, 6287-6295; Larrick, J.W., et al. 
(1989), Bio/Technology. 7. 934-938; Marks, J.D., et al. 
(1991). Eur. *7. Immunol. 21. 985-991; Taylor, et al., 
1994, Int. Immunol. 6, 579-591). The amplified Ig heavy 
and kappa light chain sequences were cloned into 
bacterial plasmids and nucleotide sequences determined. 
Analysis of the sequences spanning the heavy chain VDJ 
and light chain VJ junctions revealed in-frame heavy and 
light chain transcripts for each of the 5 clones, and in 
some cases additional out-of-frame sterile transcripts 
representing non-functional alleles. Consistent with 
proper functioning allelic exclusion, in no case was 
there more than one unique functional heavy or light 
chain transcript identified for each of the individual 
clones. Partial nucleotide sequences for each of the ten 
functional transcripts are assigned the following 
sequence I.D. No's: 1E11 gamma [Seq. I.D. No. 1]; 1E11 
kappa [Seq. I.D. No. 2]; 1G2 gamma [Seq. I.D. No. 3]; 1G2 
kappa [Seq. I.D. No. 4]; 6G5 gamma [Seq. I.D. No. 5]; 6G5 
kappa [ Seq . I.D. No . 6 ] ; 10C5 gamma [ Seq . I.D. No . 7 ] ; 
IOCS kappa [Seq. I.D. No. 8]; 4D1 gamma [Seq. I.D. No. 
9]; 4D1 kappa [Seq. I.D. No. 10] and are presented in 
Table 22. All sequences are presented in a 5 1 to 3 1 
orientation. 
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Table 22. Partial Nucleotide Sequence for 
Functional Transcripts 

1E11 gamma [Seq. I.D. No. 1] 

TGCACAAGAACATGAAACACCTGTGGTTCTTCCTCCTCCTGGTGGCAGCTCCCAGAT 
GGGTCCTGTCCCAGGTGCAGCTTCATCAGTGGGGCGCAGGACTGTTGAAGCCTTCGG 
AGACCCTGTCCCTCACCTGCGCTGTCTATGGTGGGTCCTTCAGTGGTTACTTCTGGA 
GCTGGATCCGCCAGCCCCCAGGGAGGGGGCTGGAGTGGATTGGGGAAATCCATCATC 
GTGGAAGCACCAACTACAACCCGTCCCTCGAGAGTCGAGTCACCCTATCAGTAGACA 
CGTCCAAAAACCAGTTCTCCCTGAGGCTGAGTTCTGTGACCGCCGCGGACACGGCTG 
TGTATTACTGTGCGAGAGACATTACTATGGTTCGGGGAGTACCTCACTGGGGCCAGG 
GAACCCTGGTCACC 

1E11 kappa [Seq. I.D. No. 2] 

GACAGACTTCACTCTCACCATCAGCAGACTGGAGCCTGAAGATTTTGCAGTGTATTA 
CTGTCAGCAGTATGGTAGCTCACCCCTCACTTTCGGCGGAGGGACCAAGGTGGAGAT 
CAAACGAACTGTGGCGGCACCATCTGTCTTCATCTTCCC 

1G2 gamma [ Seq . I.D. No . 3 ] 

TCCACCATCATGGGGTCAACCGCCATCCTCGCCCTCCTCCTGGCTGTTCTCCAAGGA 
GTCTGTGCCGAGGTGCAGCTGGTGCAGTCTGGAGCAGAGGTGAAAAAGCCCGGGGAG 
TCTCTGAAGATCTCCTGTAAGGGTTCTGGATACAGCTTTACCAGTTACTGGATCGCC 
TGGGTGCGCCAGATGCCCGGGAAAGGCCTGGAGTGGATGGGGATCATCGATCCTGCT 
GACTCTGATACCAGATACAACCCGTCCTTCCAAGGCCAGGTCACCATCTCAGCCGAC 
AAGTCCATCAGTACCGCCTATTTGCAGTGGAGCAGCCTGAAGGCCTCGGACACCGCC 
ATGTATTACTGTGCGAGACCAGCGAACTGGAACTGGTACTTCGTTCTCTGGGGCCGT 
GGCAC C CTGGT CACT 

1G2 kappa [Seq. I.D. No. 4] 

GACAGATTTCACTCTCACCATCAGCAGCCTGCAGCCTGAAGATTTTGCAACTTATTA 
CTGTCAACAGTTTATTAGTTACCCTCAGCTCACTTTCGGCGGAGGGACCAGGGTGGA 
GATCAAACGAACTGTGGCTGCACCATCTGTCTTCATCTTCCC 

6G5 gamma [Seq. I.D. No. 5] 

TGCACAAGAACATGAAACACCTGTGGTTCTTCCTCCTCCTGGTGGCAGCTCCCAGAT 
GGGTCCTGTCCCAGGTGCAGCTACAGCAGTGGGGCGCAGGACTGTTGAAGCCTTCGG 
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AGACCCTGTCCCTCACCTGCGCTGTCTATGGTGGGTCCTTCAGTGGTTACTACTGGA 
GCTGGATCCGCCAGCCCCCAGGTAAGGGGCTGGAGTGGATTGGGGAAATCAATCATA 
GTGGAAGCACCAACTACAACCCGTCCCTCAAGAGTCGAGTCACCATATCAGTCGACA 
CGTCCAAGAACCAGTTCTCCCTGAAACTGAGCTCTGTGACCGCCGCGGACACGGCTG 
5 TGTATTACTGTGCGAGAGTAATTAATTGGTTCGACCCCTGGGGCCAGGGAACCCTGG 
TCACC 

6G5 kappa [Seq. I.D. No. 6] 

GACAGATTTCACTCTCACCATCAGCAGCCTGCAGCCTGAAGATTTTGCAACTTACTA 
10 TTGTCAACAGGCTAATAGTTTCCCGTACACTTTTGGCCAGGGGACCAAGCTGGAGAT 
CAAACGAACTGTGGCTGCACCATCTGTCTTCATCTTCCC 

IOCS gamma [Seq. I.D. No. 7] 

ATGAAACACCTGTGGTTCTTCCTCCTCCTGGTGGCAGCTCCCAGATGGGTCCTGTCC 
15 CAGGTGCAGCTACAGCAGTGGGGCGCAGGACTGTTGAAGCCTTCGGAGACCCTGTCC 
CTCACCTGCGCTGTCTATGGTGGGTCCTTCAGTGGTTACTACTGGAGCTGGATCCGC 
CAGCCCCCAGGTAAGGGGCTGGAGTGGATTGGGGAAATCAATCATAGTGGAAGCACC 
AACTACAACCCGTCCCTCAAGAGTCGAGTCACCATATCAGTCGACACGTCCAAGAAC 
CAGTTCTCCCTGAAGCTGAGCTCTGTGACCGCCGCGGACACGGCTGTGTATTACTGT 

2 0 GCGAGAGTAATTAATTGGTTCGACCCCTGGGGCCAGGGAACCCTGGTCACCGTCTCC 

TCAG 

10C5 kappa [Seq. I.D. No. 8] 

ATGGACATGATGGTCCCCGCTCAGCTCCTGGGGCTCCTGCTGCTCTGGTTCCCAGGT 
25 TCCAGATGCGACATCCAGATGACCCAGTCTCCATCTTCCGTGTCTGCATCTGTAGGA 
GACAGAGTCACCATCACTTGTCGGGCGAGTCAGGATATTAGCAGCTGGTTAGCCTGG 
TATCAGCATAAACCAGGGAAAGCCCCTAAGCTCCTGATCTATGCTGCATCCAGTTTG 
CAAAGTGGGGTCCCATCAAGGTTCAGCGGCAGTGGATCTGGGACAGATTTCACTCTC 
ACCATCAGCAGCCTGCAGCCTGAAGATTTTGCAACTTACTATTGTCAACAGGCTAAT 

3 0 AGTTTCCCGTACACTTTTGGCCAGGGGACCAAGCTGGAGATCAAAC 

4D1 gamma [ Seq . I.D. No . 9 ] 

ATGGGGTCAACCGCCATCCTCGCCCTCCTCCTGGCTGTTCTCCAAGGAGTCTGTGCC 
GAGGTGCAGCTGGTGCAGTCTGGAGCAGAGGTGAAAAAGCCCGGGGAGTCTCTGAAG 
35 ATCTCCTGTAAGGGTTCTGGATACAGCTTTACCGGCTACTGGATCGGCTGGGTGCGC 
CAGATGCCCGGGAAAGGCCTGGAGTGGATGGGGATCATCTATCCTGGTGACTCTGAT 
ACCACATACAGCCCGTCCTTCCAAGGCCAGGTCACCATCTCAGCCGACAAGTCCATC 
AGCACCGCCTACCTGCAGTGGAGCAGCCTGAAGGCCTCGGACACCGCCATGTATTAC 
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TGTGCGAGAGACCAACTGGGCCTCTTTGACTACTGGGGCCAGGGAACCCTGGTCACC 
GTCTCCTCAGCCTCCACCAAGGGCCCATCGGTCTTCCCCCTGGCACCCTCCTCCAAG 
AAGCTT 

5 4D1 kappa [Seq. I.D. No. 10] 

ATGGACATGGAGTTCCCCGTTCAGCTCCTGGGGCTCCTGCTGCTCTGTTTCCCAGGT 
GCCAGATGTGACATCCAGATGACCCAGTCTCCATCCTCACTGTCTGCATCTGTAGGA 
GACAGAGTCACCATCACTTGTCGGGCGAGTCAGGGTATTAGCAGCTGGTTAGCCTGG 
TATCAGCAGAAACCAGAGAAAGCCCCTAAGTCCCTGATCTATTCTGCATCCAGTTTG 
10 CAAAGTGGGGTCCCATCAAGGTTCAGCGGCAGTGGATCTGGGACAGATTTCACTCTC 
ACCATCAGCAGCCTGCAGCCTGAAGATTTTGCAACTTATTACTGCCAACAGTATGAT 
AGTTACCCGTACACTTTTGGCCAGGGGACCAAGCTGGAGATCAAACGAACTGTGGCT 
GCACCATCTGTCTTCATCTTCCCGCCATCTGATGAAGCTT 



Analysis of these DNA sequences demonstrates 
that the 5 hybridoma clones represent descendants of 4 
individual primary B cells. Table 23 shows the amino 
acid sequences derived for each of the ten CDR3 regions, 

20 and the assignments for germline gene segments 

incorporated into each of the genes encoding these 
transcripts. The germline assignments are based on 
published gene sequences available from the National 
Center for Biotechnology Information, National Library of 

25 Medicine, National Institutes of Health, Bethesda, Md. 

Also see: Cook et al. 1994, Nature Genet. 7, 162-168; 
Tomlinson et al. 1992, J. Mol. Biol. 227, 776-798; 
Matsuda et al. 1993, Nature Genet. 3, 88-94; Schable and 
Zachau, 1993, Biol. Chem. Hoppe-Seyler 374, 1001-1022; 

30 Cox et al. 1994, Eur. J. Immunol. 24, 827-836; Ravetch et 

al. 1981, Cell 27, 583-591; Ichihara et al. 1988, EMBO J. 
7, 4141-4150; Yamada et al. 1991, J. Exp. Med. 173, 
395-407; Sanz , 1991, J. Immunol. 147, 1720-1729. 

35 Table 23. Germline V(D)J Segment Usage in Hybridoma 

Transcripts. 



clone 


h . c . CDR3 


VH 


DH 


JH 


I.e. CDR3 


Vk 


Jk 
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1E11 


DITMVRGVPH 
(Seq. I.D. 
No. 63) 


VH4-34 


DXP' 1 


JH4 


QQYGSSPLT 
(Seq. I.D. 
No. 67) 


VkA27/All 


Jk4 


1G2 


PANWNWYFVL 
(Seq. I.D. 
No. 64) 


VH5-51 


DHQ52 


JH2 


QQFISYPQLT 
(Seq. I.D. 
No. 68) 


VkL18 


Jk4 


6G5 


VINWFDP 
(Seq. I.D. 
No. 65) 


VH4-34 


n.d. 


JH5 


QQANSFPYT 
(Seq. I.D. 
No. 69) 


VkL19 


Jk2 


IOCS 


VINWFDP 


VH4-34 


n.d. 


JH5 


QQANSFPYT 


VkL19 


Jk2 


4D1 


DQLGLFDY 
(Seq. I.D. 
No. 66) 


VH5-51 


DHQ52 


JH4 


QQYDSYPYT 
(Seq. I.D. 
No. 70) 


VkL15 


Jk2 



n.d. could not be determined from nucleotide 
sequence . 



Example 42. 

15 Construction of Minicrenes for Expression of Human 

IcrGkappa AntiCD4 Antibodies in Transfected Cell lines. 

This example demonstrates the process of making 
a wholly artificial gene that encodes an immunoglobulin 
polypeptide (i.e., an immunoglobulin heavy chain or light 

2 0 chain) . Plasmids were constructed so that PCR amplified 

V heavy and V light chain cDNA sequences could be used to 
reconstruct complete heavy and light chain minigenes. 

The kappa light chain plasmid, pCK7-96, 
includes the kappa constant region and polyadenylation 
25 site [Seq. ID No. 11], such that kappa sequences 

amplified with 5' primers that include Hindlll sites 
upstream of the initiator methionine can be digested with 
Hindlll and Bbsl, and cloned into pCK7-96 digested with 
Hindu I and Bbsl to reconstruct a complete light chain 

3 0 coding sequence together with a polyadenylation site. 
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This cassette can be isolated as a Hindlll/NotI fragment 
and ligated to transcription promoter sequences to create 
a functional minigene for transfection into cells. 

The gammal heavy chain plasmid, pCG7-9 6, 
includes the human gammal constant region and 
polyadenylation site [Seq. ID No. 12], such that gamma 
sequences amplified with 5' primers that include Hindlll 
sites upstream of the initiator methionine can be 
digested with Hindlll and Agel, and cloned into pCG7-96 
digested with Hindlll and Agel to reconstruct a complete 
gammal heavy chain coding sequence together with a 
polyadenylation site. This cassette can be isolated as a 
Hindlll /Sail fragment and ligated to transcription 
promoter sequences to create a functional minigene for 
transfection into cells. 

The following example demonstrates how 
nucleotide sequence data from hybridomas can be used to 
reconstruct functional Ig heavy and light chain 
minigenes. The nucleotide sequences of heavy and light 
chain transcripts from hybridomas 6G5 and 10C5 were used 
to design an overlapping set of synthetic 
oligonucleotides to create synthetic V sequences with 
identical amino acid coding capacities as the natural 
sequences. The synthetic heavy and kappa light chain 
sequences (designated HC6G5 [Seq. I.D. No. 61] and LC6G5 
[Seq. I.D. No. 62] differed from the natural sequences in 
three ways: strings of repeated nucleotide bases were 
interrupted to facilitate oligonucleotide synthesis and 
PCR amplification; optimal translation initiation sites 
were incorporated according to Kozak's rules (Kozak, 
1991, J. Biol. Chem. 266, 19867-19870); and, Hindlll 
sites were engineered upstream of the translation 
initiation sites. 
A. Synthetic kappa light chain. 

Light chain PCR reaction 1. 

The following oligonucleotides were pooled: o- 
548 [Seq. I.D. No. 13], o-549 [Seq. I.D. No. 14], O-550 
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[Seq. I.D. No* 15], o-551 [Seq. I.D. No, 16], o-552 [Seq. 
I.D. No. 17], o-563 [Seq. I.D. No. 18], o-564 [Seq. I.D. 
No. 19], o-565 [Seq. I.D. No. 20], o-566 [Seq. I.D. No. 
21], o-567 [Seq. I.D. No. 22], and amplified with the 
5 following 2 primers: o-527 [Seq. I.D. No. 23] and o-562 

[Seq. I.D. No. 24] . 

Light chain PGR reaction 2 . 

The following oligonucleotides were pooled: o- 
10 553 [Seq. I.D. No. 25], o-554 [Seq. I.D. No. 26], o-555 

[Seq. I.D. No. 27], o-556 [Seq. I.D. No. 28], o-557 [Seq. 

I.D. No. 29], o-558 [Seq. I.D. No. 30], o-559 [Seq. I.D. 

No. 31], O-560 [Seq. I.D. No. 32], o-561 [Seq. I.D. No. 

33], o-562 [Seq. I.D. No. 24], and amplified with the 
15 following 2 primers: o-552 [Seq. I.D. No. 17] and o-493 

[Seq. I.D. No. 34] . 

Light chain PCR reaction 3 . 

The products of light chain PCR reactions 1 and 
20 2 were then combined and amplified with the following two 

primers: o-493 [Seq. I.D. No. 34] and o-527 [Seq. I.D. 
No. 23]. 

The product of light chain PCR reaction 3 was 
then digested with Hindlll and Bbsl and cloned into 
25 Hindlll/Bbsl digested pCK7-96 [Seq. I.D. No. 11] to 

generate pLC6G5 [Seq. I.D. No. 35]. 

B. Synthetic gamma heavy chain. 
3 0 Heavy chain PCR reaction 1. 

The following oligonucleotides were pooled: o- 

528 [Seq. I.D. No. 36], o-529 [Seq. I.D. No. 37], O-530 

[Seq. I.D. No. 38] , o-531 [Seq. I.D. No. 39] , o-532 [Seq. 

I.D. No. 40], o-543 [Seq. I.D. No. 41], o-544 [Seq. I.D. 
35 No. 42], o-545 [Seq. I.D. No. 43], o-546 [Seq. I.D. No. 

44], o-547 [Seq. I.D. No. 45], and amplified with the 

following 2 primers: o-496 [Seq. I.D. No. 46] and o-542 

[Seq. I.D. No. 47] . 
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Heavy chain PCR reaction 2. 

The following oligonucleotides were pooled: o- 
533 [Seq. I.D. No* 48] , o-534 [Seq. I.D. No. 49], o-535 
[Seq. I.D. No. 50] , o-53 6 [Seq. I.D. No. 51] , o-53 7 [Seq. 
5 I.D. No. 52], o-538 [Seq. I.D. No. 56], o-539 [Seq. I.D. 

No. 53], O-540 [Seq. I.D. No. 54], o-541 [Seq. I.D. No. 
55], o-542 [Seq. I.D. No. 47], together with the isolated 
439 bp Bbsl fragment of pCG7-96 [Seq. I.D. No. 12] and 
amplified with the following 2 primers: o-490 [Seq. I.D. 
10 No. 57] and O-520 [Seq. I.D. No. 58]. 

Heavy chain PCR reaction 3 . 

The products of heavy chain reactions 1 and 2 
were then combined and amplified with the following two 
15 primers: o«52 0 [Seq. I.D. No. 58] and o-521 [Seq. I.D. 

No. 59]. 

The product of heavy chain reaction 3 was then 
digested with Hindlll and Agel and cloned into 
Hindlll/Agel digested pCG7-96 [Seq. I.D. No. 12] to 

2 0 generate pHC6G5 [Seq. I.D. No. 60]. 

Table 24. Primers , Vectors and Products 
Used in Miniaene Construction 

25 pCK7-96 [Seq. I.D. No. 11] 

TCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCG 
GTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCA 
GGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCG 
TTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGC 

3 0 TCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCT 

GGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCC 
GCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTC 
AGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAG 
CCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACAC 
3 5 GACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTA 
GGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACA 
GTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGC 
TCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAG 
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CAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGG 
TCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCA 
AAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAA 
AGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCT 
5 ATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAG 
ATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGA 
GACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCC 
GAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGC 
CGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATT 

10 GCTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGT 
TCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGC 
TCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATG 
GTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCT 
GTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGT 

15 TGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGCAGAACTTTAAAA 
GTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTG 
TTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTT 
ACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAG 
GGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTAT 

2 0 TGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAG 

AAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTC 
TAAGAAACCATTATTATCATGACATTAACCTATAAAAATAGGCGTATCACGAGGCCC 
TTTCGTCTCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGACACATGCAGCTCCCG 
GAGACGGTCACAGCTTGTCTGTAAGCGGATGCCGGGAGCAGACAAGCCCGTCAGGGC 
25 GCGTCAGCGGGTGTTGGCGGGTGTCGGGGCTGGCTTAACTATGCGGCATCAGAGCAG 
ATTGTACTGAGAGTGCACCATATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGA 
AAATACCGCATCAGGCGCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGA 
TCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGG 
CGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGACGTTGTAAAACGACGGCC 

3 0 AGTGCCAAGCTAGCGGCCGCGGTCCAACCACCAATCTCAAAGCTTGGTACCCGGGAG 

CCTGTTATCCCAGCACAGTCCTGGAAGAGGCACAGGGGAAATAAAAGCGGACGGAGG 
CTTTCCTTGACTCAGCCGCTGCCTGGTCTTCTTCAGACCTGTTCTGAATTCTAAACT 
CTGAGGGGGTCGGATGACGTGGCCATTCTTTGCCTAAAGCATTGAGTTTACTGCAAG 
GTCAGAAAAGCATGCAAAGCCCTCAGAATGGCTGCAAAGAGCTCCAACAAAACAATT 
35 TAGAACTTTATTAAGGAATAGGGGGAAGCTAGGAAGAAACTCAAAACATCAAGATTT 
TAAATACGCTTCTTGGTCTCCTTGCTATAATTATCTGGGATAAGCATGCTGTTTTCT 
GTCTGTCCCTAACATGCCCTGTGATTATCCGCAAACAACACACCCAAGGGCAGAACT 
TTGTTACTTAAACACCATCCTGTTTGCTTCTTTCCTCAGGAACTGTGGCTGCACCAT 
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CTGTCTTCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTG 
TGTGCCTGCTGAATAACTTCTATCCCAGAGAGGCCAAAGTACAGTGGAAGGTGGATA 
ACGCCCTCCAATCGGGTAACTCCCAGGAGAGTGTCACAGAGCAGGACAGCAAGGACA 
GCACCTACAGCCTCAGCAGCACCCTGACGCTGAGCAAAGCAGACTACGAGAAACACA 
5 AAGTCTACGCCTGCGAAGTCACCCATCAGGGCCTGAGCTCGCCCGTCACAAAGAGCT 
TCAACAGGGGAGAGTGTTAGAGGGAGAAGTGCCCCCACCTGCTCCTCAGTTCCAGCC 
TGACCCCCTCCCATCCTTTGGCCTCTGACCCTTTTTCCACAGGGGACCTACCCCTAT 
TGCGGTCCTCCAGCTCATCTTTCACCTCACCCCCCTCCTCCTCCTTGGCTTTAATTA 
TGCTAATGTTGGAGGAGAATGAATAAATAAAGTGAATCTTTGCACCTGTGGTTTCTC 

10 TCTTTCCTCAATTTAATAATTATTATCTGTTGTTTACCAACTACTCAATTTCTCTTA 
TAAGGGACTAAATATGTAGTCATCCTAAGGCGCATAACCATTTATAAAAATCATCCT 
TCATTCTATTTTACCCTATCATCCTCTGCAAGACAGTCCTCCCTCAAACCCACAAGC 
CTTCTGTCCTCACAGTCCCCTGGGCCATGGATCCTCACATCCCAATCCGCGGCCGCA 
ATTCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTC 

15 CACACAACATACGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGA 
GCTAACTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGT 
CGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTG 
GGCGC 

20 pCG7-96 [Seq. I.D. No. 12] 

GAACTCGAGCAGCTGAAGCTTTCTGGGGCAGGCCAGGCCTGACCTTGGCTTTGGGGC 
AGGGAGGGGGCTAAGGTGAGGCAGGTGGCGCCAGCCAGGTGCACACCCAATGCCCAT 
GAGCCCAGACACTGGACGCTGAACCTCGCGGACAGTTAAGAACCCAGGGGCCTCTGC 
GCCCTGGGCCCAGCTCTGTCCCACACCGCGGTCACATGGCACCACCTCTCTTGCAGC 

25 CTCCACCAAGGGCCCATCGGTCTTCCCCCTGGCACCCTCCTCCAAGAGCACCTCTGG 
GGGCACAGCGGCCCTGGGCTGCCTGGTCAAGGACTACTTCCCCGAACCGGTGACGGT 
GTCGTGGAACTCAGGCGCCCTGACCAGCGGCGTGCACACCTTCCCGGCTGTCCTACA 
GTCCTCAGGACTCTACTCCCTCAGCAGCGTGGTGACCGTGCCCTCCAGCAGCTT 
GGGCACCCAGACCTACATCTGCAACGTGAATCACAAGCCCAGCAACACCAAGGTGGA 

30 CAAGAAAGTTGGTGAGAGGCCAGCACAGGGAGGGAGGGTGTCTGCTGGAAGCCAGGC 
TCAGCGCTCCTGCCTGGACGCATCCCGGCTATGCAGCCCCAGTCCAGGGCAGCAAGG 
CAGGCCCCGTCTGCCTCTTCACCCGGAGGCCTCTGCCCGCCCCACTCATGCTCAGGG 
AGAGGGTCTTCTGGCTTTTTCCCCAGGCTCTGGGCAGGCACAGGCTAGGTGCCCCTA 
ACCCAGGCCCTGCACACAAAGGGGCAGGTGCTGGGCTCAGACCTGCCAAGAGCCATA 

3 5 TCCGGGAGGACCCTGCCCCTGACCTAAGCCCACCCCAAAGGCCAAACTCTCCACTCC 
CTCAGCTCGGACACCTTCTCTCCTCCCAGATTCCAGTAACTCCCAATCTTCTCTCTG 
CAGAGCCCAAATCTTGTGACAAAACTCACACATGCCCACCGTGCCCAGGTAAGCCAG 
CCCAGGCCTCGCCCTCCAGCTCAAGGCGGGACAGGTGCCCTAGAGTAGCCTGCATCC 
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AGGGACAGGCCCCAGCCGGGTGCTGACACGTCCACCTCCATCTCTTCCTCAGCACCT 
GAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTC 
ATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGAGCCACGAAGAC 
CCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATGCCAAGACA 
5 AAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCACCGTC 
CTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCC 
CTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGTGGGACCCGTGGGGTG 
CGAGGGCCACATGGACAGAGGCCGGCTCGGCCCACCCTCTGCCCTGAGAGTGACCGC 
TGTACCAACCTCTGTCCCTACAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCC 

10 CCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGG 
CTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAA 
CTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAA 
GCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGAT 
GCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAA 

15 ATGAGTGCGACGGCCGGCAAGCCCCCGCTCCCCGGGCTCTCGCGGTCGCACGAGGAT 
GCTTGGCACGTACCCCCTGTACATACTTCCCGGGCGCCCAGCATGGAAATAAAGCAC 
CCAGCGCTGCCCTGGGCCCCTGCGAGACTGTGATGGTTCTTTCCACGGGTCAGGCCG 
AGTCTGAGGCCTGAGTGGCATGAGGGAGGCAGAGCGGGTCCCACTGTCCCCACACTG 
GCCCAGGCTGTGCAGGTGTGCCTGGGCCCCCTAGGGTGGGGCTCAGCCAGGGGCTGC 

2 0 CCTCGGCAGGGTGGGGGATTTGCCAGCGTGGCCCTCCCTCCAGCAGCACCTGCCCTG 
GGCTGGGCCACGGGAAGCCCTAGGAGCCCCTGGGGACAGACACACAGCCCCTGCCTC 
TGTAGGAGACTGTCCTGTTCTGTGAGCGCCCCTGTCCTCCCGACCTCCATGCCCACT 
CGGGGGCATGCCTGCAGGTCGACTCTAGAGGATCCCCGGGTACCGAGCTCGAATTCA 
TCGATGATATCAGATCTGCCGGTCTCCCTATAGTGAGTCGTATTAATTTCGATAAGC 

2 5 CAGGTTAACCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATT 

GGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGG 
CGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGAT 
AACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAG 
GCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAAT 

3 0 CGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTT 

CCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATAC 
CTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGTAGG 
TATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCC 
GTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTA 
35 AGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGG 
TATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGA 
AGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTT 
GGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGC 
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AAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCT 
ACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGA 
TTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCA 
ATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAG 
5 GCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTC 
GTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATA 
CCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGA 
AGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAAT 
TGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTT 

10 GCCATTGCTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGC 
TCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCG 
GTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCA 
CTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGC 
TTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGA 

15 CCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGCAGAACT 
TTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTA 
CCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCA 
TCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCA 
AAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAA 

2 0 TATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGT 

ATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCT 
GACGTCTAAGAAACCATTATTATCATGACATTAACCTATAAAAATAGGCGTATCACG 
AGGCCCTTTCGTCTCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGACACATGCAG 
CTCCCGGAGACGGTCACAGCTTGTCTGTAAGCGGATGCCGGGAGCAGACAAGCCCGT 
25 CAGGGCGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCTGGCTTAACTATGCGGCATCA 
GAGCAGATTGTACTGAGAGTGCACCATATGGACATATTGTCGTTAGAACGCGGCTAC 
AATTAATACATAACCTTATGTATCATACACATACGATTTAGGTGACACTATA 

0-548 [Seq. I.D. No. 13 ] 

3 0 ATGGTCCCAGCTCAGCTCCTCGGTCTCCTGCTGCTCTGGTTCCC 

0-549 [Seq. I.D. No. 14] 

AGGTTCCAGATGCGACATCCAGATGACCCAGTCTCCATCTTCCG 

35 O-550 [Seq. I.D. No. 15] 

TGTCTGCATCTGTAGGAGACAGAGTCACCATCACTTGTCGGGCG 

0-551 [Seq. I.D. No. 16] 



265 



AGTCAGGATATTAGCAGCTGGTTAGCCTGGTATCAGCATAAACC 
0-552 [Seq. I.D. No. 17] 

AGGTAAAGCACCTAAGCTCCTGATCTATGCTGCATCCAGTTTGC 

5 

0-563 [Seq. I.D. No. 18] 

AGGAGCTTAGGTGCTTTACCTGGTTTATGCTGATACCAGGCTAA 

0-564 [Seq. I.D. No. 19] 
10 CCAGCTGCTAATATCCTGACTCGCCCGACAAGTGATGGTGACTC 

0-565 [Seq. I.D. No. 20] 

TGTCTCCTACAGATGCAGACACGGAAGATGGAGACTGGGTCATC 

15 0-566 [Seq. I.D. No. 21] 

TGGATGTCGCATCTGGAACCTGGGAACCAGAGCAGCAGGAGACC 

0-567 [Seq. I.D. No. 22] 

GAGGAGCTGAGCTGGGACCATCATGGTGGCAAGCTTAGAGTC 

20 

0-527 [Seq. I.D. No. 23] 
GACTCTAAGCTTGCCACCATGATGGTCC 

0-562 [Seq. I.D. No. 24] 
25 ACCTTGATGGGACACCACTTTGCAAACTGGATGCAGCATAGATC 

0-553 [Seq. I.D. No. 25] 

AAAGTGGTGTCCCATCAAGGTTCAGCGGAAGTGGATCTGGGACA 

30 0-554 [Seq. I.D. No. 26] 

GATTTCACTCTCACCATCAGCAGCCTGCAGCCTGAAGATTTTGC 

0-555 [Seq. I.D. No. 27] 

AACTTACTATTGTCAACAGGCTAATAGTTTCCCGTACACTTTTG 

35 

0-556 [Seq. I.D. No. 28] 

GTCAGGGAACCAAGCTGGAGATCAAACGAACTGTGGCTGCACCA 
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0-557 [ Seq . I . D . No . 2 9] 

TCTGTCTTCATCTTCCCGCCATCTGATGAGCAGTTGA 

0-5 58 [ Seq ♦ I . D . No . 3 0] 
5 GGGAAGATGAAGACAGATGGTGCAGCCACAGTTCGTTTGA 

0-559 [Seq. I.D. No. 31] 

TCTCCAGCTTGGTTCCCTGACCAAAAGTGTACGGGAAACTATTA 

10 O-560 [Seq. I.D. No. 32] 

GCCTGTTGACAATAGTAAGTTGCAAAATCTTCAGGCTGCAGGCT 

0-561 [Seq. I.D. No. 33] 

GCTGATGGTGAGAGTGAAATCTGTCCCAGATCCACTTCCGCTGA 

15 

0-493 [Seq. I.D. No. 34] 
TCAACTGCTCATCAGATGGC 

pLC6G5 [Seq. I.D. No. 35] 

2 0 TCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCG 

GTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCA 
GGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCG 
TTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGC 
TCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCT 
25 GGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCC 
GCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTC 
AGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAG 
CCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACAC 
GACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTA 

3 0 GGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACA 

GTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGC 
TCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAG 
CAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGG 
TCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCA 
3 5 AAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAA 
AGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCT 
ATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAG 
ATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGA 
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GACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCC 
GAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGC 
CGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATT 
GCTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGT 
5 TCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGC 
TCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATG 
GTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCT 
GTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGT 
TGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGCAGAACTTTAAAA 

10 GTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTG 
TTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTT 
ACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAG 
GGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTAT 
TGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAG 

15 AAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTC 
TAAGAAACCATTATTATCATGACATTAACCTATAAAAATAGGCGTATCACGAGGCCC 
TTTCGTCTCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGACACATGCAGCTCCCG 
GAGACGGTCACAGCTTGTCTGTAAGCGGATGCCGGGAGCAGACAAGCCCGTCAGGGC 
GCGTCAGCGGGTGTTGGCGGGTGTCGGGGCTGGCTTAACTATGCGGCATCAGAGCAG 

20 ATTGTACTGAGAGTGCACCATATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGA 
AAATACCGCATCAGGCGCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGA 
TCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGG 
CGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGACGTTGTAAAACGACGGCC 
AGTGCCAAGCTAGCGGCCGCGGTCCAACCACCAATCTCAAAGCTTGCCACCATGATG 

2 5 GTCCCAGCTCAGCTCCTCGGTCTCCTGCTGCTCTGGTTCCCAGGTTCCAGATGCGAC 
ATCCAGATGACCCAGTCTCCATCTTCCGTGTCTGCATCTGTAGGAGACAGAGTCACC 
ATCACTTGTCGGGCGAGTCAGGATATTAGCAGCTGGTTAGCCTGGTATCAGCATAAA 
CCAGGTAAAGCACCTAAGCTCCTGATCTATGCTGCATCCAGTTTGCAAAGTGGTGTC 
CCATCAAGGTTCAGCGGAAGTGGATCTGGGACAGATTTCACTCTCACCATCAGCAGC 

30 CTGCAGCCTGAAGATTTTGCAACTTACTATTGTCAACAGGCTAATAGTTTCCCGTAC 
ACTTTTGGTCAGGGAACCAAGCTGGAGATCAAACGAACTGTGGCTGCACCATCTGTC 
TTCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGC 
CTGCTGAATAACTTCTATCCCAGAGAGGCCAAAGTACAGTGGAAGGTGGATAACGCC 
CTCCAATCGGGTAACTCCCAGGAGAGTGTCACAGAGCAGGACAGCAAGGACAGCACC 

35 TACAGCCTCAGCAGCACCCTGACGCTGAGCAAAGCAGACTACGAGAAACACAAAGTC 
TACGCCTGCGAAGTCACCCATCAGGGCCTGAGCTCGCCCGTCACAAAGAGCTTCAAC 
AGGGGAGAGTGTTAGAGGGAGAAGTGCCCCCACCTGCTCCTCAGTTCCAGCCTGACC 
CCCTCCCATCCTTTGGCCTCTGACCCTTTTTCCACAGGGGACCTACCCCTATTGCGG 
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TCCTCCAGCTCATCTTTCACCTCACCCCCCTCCTCCTCCTTGGCTTTAATTATGCTA 
ATGTTGGAGGAGAATGAATAAATAAAGTGAATCTTTGCACCTGTGGTTTCTCTCTTT 
CCTCAATTTAATAATTATTATCTGTTGTTTACCAACTACTCAATTTCTCTTATAAGG 
GACTAAATATGTAGTCATCCTAAGGCGCATAACCATTTATAAAAATCATCCTTCATT 
5 CTATTTTACCCTATCATCCTCTGCAAGACAGTCCTCCCTCAAACCCACAAGCCTTCT 
GTCCTCACAGTCCCCTGGGCCATGGATCCTCACATCCCAATCCGCGGCCGCAATTCG 
TAATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACAC 
AACATACGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAGCTAA 
CTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGC 
10 CAGCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGC 

0-52 8 [ Seq . I . D . No, 3 6] 

TTCTTCCTCCTCCTGGTGGCAGCTCCTAGATGGGTCCTGTCTC 

15 0-529 [Seq. I.D. No. 37] 

AGGTGCAGCTACAGCAGTGGGGCGCAGGACTGTTGAAGCCTTC 

O- 5 3 0 [ Seq . I.D. No . 38] 

GGAGACCCTGTCCCTCACCTGCGCTGTCTATGGTGGTTCCTTC 

20 

0-531 [Seq. I.D. No. 39] 

AGTGGTTACTACTGGAGCTGGATCCGCCAGCCACCAGGTAAGG 

0-532 [Seq. I.D. No. 40] 
25 GTCTGGAGTGGATTGGTGAAATCAATCATAGTGGAAGCACCAA 

0-543 [Seq. I.D. No. 41] 

TTCACCAATCCACTCCAGACCCTTACCTGGTGGCTGGCGGATC 

30 0-544 [Seq. I.D. No. 42] 

CAGCTCCAGTAGTAACCACTGAAGGAACCACCATAGACAGCGC 

0-545 [Seq. I.D. No. 43] 

AGGTGAGGGACAGGGTCTCCGAAGGCTTCAACAGTCCTGCGCC 

35 

0-54 6 [Seq. I.D. No. 44 ] 

CCACTGCTGTAGCTGCACCTGAGACAGGACCCATCTAGGAGCT 
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0-547 [Seq. I.D. No. 45] 

GCCACCAGGAGGAGGAAGAACCACAGGTGTTTCATGGTGGCAAGCTTG 

0-496 [Seq. I.D. No. 46] 
5 CATGAAACACCTGTGGTTCTTCC 

0-542 [Seq. I.D. No. 47] 

TCTTGAGAGACGGGTTGTAGTTGGTGCTTCCACTATGATTGAT 

10 0-533 [Seq. I.D. No. 48] 

CTACAACCCGTCTCTCAAGAGTCGAGTCACCATATCAGTAGAC 

0-534 [Seq. I.D. No. 49] 

ACGTCCAAGAACCAGTTCTCTCTGAAACTGAGCTCTGTGACCG 

15 

0-535 [Seq. I.D. No. 50] 

CTGCGGACACGGCTGTGTATTACTGTGCGAGAGTAATTAATTG 

0-536 [Seq. I.D. No. 51] 
2 0 GTTCGACCCTTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCA 

0-537 [Seq. I.D. No. 52] 

GCCTCAACCAAGGGCCCATCGGTCTTCCCCCTGGCACC 

25 0-539 [Seq. I.D. No. 53] 

C CTGG C C C CAAGGGT CG AAC C AATT AATT AC T CTCG C AC AGT A 

O-540 [Seq. I.D. No. 54] 

ATACACAGCCGTGTCCGCAGCGGTCACAGAGCTCAGTTTCAGA 

30 

0-541 [Seq. I.D. No. 55] 

GAGAACTGGTTCTTGGACGTGTCTACTGATATGGTGACTCGAC 

0-538 [Seq. I.D. No. 56] 
35 CGATGGGCCCTTGGTTGAGGCTGAGGAGACGGTGACCAGGGTTC 

O-490 [Seq. I.D. No. 57] 
GAAGCACCAACTACAACCCG 
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O-520 [Seq. I.D. No. 58] 
GAGTTCCACGACACCGTCACC 

0-521 [Seq. I.D. No. 59] 
5 GACCTCAAGCTTGCCACCATGAAACACCTGTGG 

pHC6G5 [ Seq . I.D. No . 60] 

GAACTCGAGCAGCTGAAGCTTGCCACCATGAAACACCTGTGGTTCTTCCTCCTCCTG 
GTGGCAGCTCCTAGATGGGTCCTGTCTCAGGTGCAGCTACAGCAGTGGGGCGCAGGA 

10 CTGTTGAAGCCTTCGGAGACCCTGTCCCTCACCTGCGCTGTCTATGGTGGTTCCTTC 
AGTGGTTACTACTGGAGCTGGATCCGCCAGCCACCAGGTAAGGGTCTGGAGTGGATT 
GGTGAAATCAATCATAGTGGAAGCACCAACTACAACCCGTCTCTCAAGAGTCGAGTC 
ACCATATCAGTAGACACGTCCAAGAACCAGTTCTCTCTGAAACTGAGCTCTGTGACC 
GCTGCGGACACGGCTGTGTATTACTGTGCGAGAGTAATTAATTGGTTCGACCCTTGG 

15 GGCCAGGGAACCCTGGTCACCGTCTCCTCAGCCTCAACCAAGGGCCCATCGGTCTTC 
CCCCTGGCACCCTCCTCCAAGAGCACCTCTGGGGGCACAGCGGCCCTGGGCTGCCTG 
GTCAAGGACTACTTCCCCGAACCGGTGACGGTGTCGTGGAACTCAGGCGCCCTGACC 
AGCGGCGTGCACACCTTCCCGGCTGTCCTACAGTCCTCAGGACTCTACTCCCTCAGC 
AGCGTGGTGACCGTGCCCTCCAGCAGCTTGGGCACCCAGACCTACATCTGCAACGTG 

2 0 AATCACAAGCCCAGCAACACCAAGGTGGACAAGAAAGTTGGTGAGAGGCCAGCACAG 
GGAGGGAGGGTGTCTGCTGGAAGCCAGGCTCAGCGCTCCTGCCTGGACGCATCCCGG 
CTATGCAGCCCCAGTCCAGGGCAGCAAGGCAGGCCCCGTCTGCCTCTTCACCCGGAG 
GCCTCTGCCCGCCCCACTCATGCTCAGGGAGAGGGTCTTCTGGCTTTTTCCCCAGGC 
TCTGGGCAGGCACAGGCTAGGTGCCCCTAACCCAGGCCCTGCACACAAAGGGGCAGG 

2 5 TGCTGGGCTCAGACCTGCCAAGAGCCATATCCGGGAGGACCCTGCCCCTGACCTAAG 

CCCACCCCAAAGGCCAAACTCTCCACTCCCTCAGCTCGGACACCTTCTCTCCTCCCA 
GATTCCAGTAACTCCCAATCTTCTCTCTGCAGAGCCCAAATCTTGTGACAAAACTCA 
CACATGCCCACCGTGCCCAGGTAAGCCAGCCCAGGCCTCGCCCTCCAGCTCAAGGCG 
GGACAGGTGCCCTAGAGTAGCCTGCATCCAGGGACAGGCCCCAGCCGGGTGCTGACA 

3 0 CGTCCACCTCCATCTCTTCCTCAGCACCTGAACTCCTGGGGGGACCGTCAGTCTTCC 

TCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACAT 
GCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGG 
ACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCA 
CGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGG 
35 AGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCT 
CCAAAGCCAAAGGTGGGACCCGTGGGGTGCGAGGGCCACATGGACAGAGGCCGGCTC 
GGCCCACCCTCTGCCCTGAGAGTGACCGCTGTACCAACCTCTGTCCCTACAGGGCAG 
CCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAAC 



CAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAG 

TGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGAC 

TCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACG 

CAGAAGAGCCTCTCCCTGTCTCCGGGTAAATGAGTGCGACGGCCGGCAAGCCCCCGC 

TCCCCGGGCTCTCGCGGTCGCACGAGGATGCTTGGCACGTACCCCCTGTACATACTT 

CCCGGGCGCCCAGCATGGAAATAAAGCACCCAGCGCTGCCCTGGGCCCCTGCGAGAC 

TGTGATGGTTCTTTCCACGGGTCAGGCCGAGTCTGAGGCCTGAGTGGCATGAGGGAG 

GCAGAGCGGGTCCCACTGTCCCCACACTGGCCCAGGCTGTGCAGGTGTGCCTGGGCC 

CCCTAGGGTGGGGCTCAGCCAGGGGCTGCCCTCGGCAGGGTGGGGGATTTGCCAGCG 

TGGCCCTCCCTCCAGCAGCACCTGCCCTGGGCTGGGCCACGGGAAGCCCTAGGAGCC 

CCTGGGGACAGACACACAGCCCCTGCCTCTGTAGGAGACTGTCCTGTTCTGTGAGCG 

CCCCTGTCCTCCCGACCTCCATGCCCACTCGGGGGCATGCCTGCAGGTCGACTCTAG 

AGGATCCCCGGGTACCGAGCTCGAATTCATCGATGATATCAGATCTGCCGGTCTCCC 

TATAGTGAGTCGTATTAATTTCGATAAGCCAGGTTAACCTGCATTAATGAATCGGCC 

AACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTG 

ACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGG 

TAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAG 

GCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGC 

TCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACC 

CGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTC 

CTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCG 

TGGCGCTTTCTCAATGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCT 

CCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCG 

GTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAG 

CCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGA 

AGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGC 

TGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCA 

CCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAG 

GATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAA 

ACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCC 

TTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGT 

CTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTC 

GTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCT 

TACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAG 

ATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAA 

CTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTT 

CGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTGTCAC 
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GCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTA 
CATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTG 
TCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATT 
CTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCA 
5 AGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATAC 
GGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTT 
CTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAAC 
CCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGT 
GAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAAT 

10 GTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATT 
GTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTC 
CGCGCACATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCATTATTATCATGA 
CATTAACCTATAAAAATAGGCGTATCACGAGGCCCTTTCGTCTCGCGCGTTTCGGTG 
ATGACGGTGAAAACCTCTGACACATGCAGCTCCCGGAGACGGTCACAGCTTGTCTGT 

15 AAGCGGATGCCGGGAGCAGACAAGCCCGTCAGGGCGCGTCAGCGGGTGTTGGCGGGT 
GTCGGGGCTGGCTTAACTATGCGGCATCAGAGCAGATTGTACTGAGAGTGCACCATA 
TGGACATATTGTCGTTAGAACGCGGCTACAATTAATACATAACCTTATGTATCATAC 
ACATACGATTTAGGTGACACTATA 

20 HC6G5 [Seq. I.D. No. 61] 

AAGCTTGCCACCATGAAACACCTGTGGTTCTTCCTCCTCCTGGTGGCAGCTCCTAGA 
TGGGTCCTGTCTCAGGTGCAGCTACAGCAGTGGGGCGCAGGACTGTTGAAGCCTTCG 
GAGACCCTGTCCCTCACCTGCGCTGTCTATGGTGGTTCCTTCAGTGGTTACTACTGG 
AGCTGGATCCGCCAGCCACCAGGTAAGGGTCTGGAGTGGATTGGTGAAATCAATCAT 

25 AGTGGAAGCACCAACTACAACCCGTCTCTCAAGAGTCGAGTCACCATATCAGTAGAC 
ACGTCCAAGAACCAGTTCTCTCTGAAACTGAGCTCTGTGACCGCTGCGGACACGGCT 
GTGTATTACTGTGCGAGAGTAATTAATTGGTTCGACCCTTGGGGCCAGGGAACCCTG 
GTCACCGTCTCCTCAGCCTCAACCAAGGGCCCATCGGTCTTCCCCCTGGCACCCTCC 
TCCAAGAGCACCTCTGGGGGCACAGCGGCCCTGGGCTGCCTGGTCAAGGACTACTTC 

3 0 CCCGAACCGGT 

LC6G5 [Seq, I.D. No. 62] 

AAGCTTGCCACCATGATGGTCCCAGCTCAGCTCCTCGGTCTCCTGCTGCTCTGGTTC 
CCAGGTTCCAGATGCGACATCCAGATGACCCAGTCTCCATCTTCCGTGTCTGCATCT 
35 GTAGGAGACAGAGTCACCATCACTTGTCGGGCGAGTCAGGATATTAGCAGCTGGTTA 
GCCTGGTATCAGCATAAACCAGGTAAAGCACCTAAGCTCCTGATCTATGCTGCATCC 
AGTTTGCAAAGTGGTGTCCCATCAAGGTTCAGCGGAAGTGGATCTGGGACAGATTTC 
ACTCTCACCATCAGCAGCCTGCAGCCTGAAGATTTTGCAACTTACTATTGTCAACAG 



GCTAATAGTTTCCCGTACACTTTTGGTCAGGGAACCAAGCTGGAGATCAAACGAACT 
GTGGCTGCACCATCTGTCTTC 

Example 43. 

Binding of Human Anti-CD4 Monoclonal Antibodies to 
Non-Human Primate Lymphocytes. 
It is desirable to be able perform preclinical 
toxicology and pharmacokinetic studies of human anti-CD4 
monoclonal antibodies in animal models. It is further 
desirable for some purposes that the animal be a 
non-human primate that expresses CD4 comprising a 
cross-reactive epitope with human CD4 such that it is 
recognized by the monoclonal antibody. Three different 
non-human primate species, chimpanzee, rhesus, and 
cynomolgus monkeys, were tested for cross-reactive CD4 
epitopes with the 5 different human anti-CD4 monoclonal 
antibodies from hybridomas 1E11, 1G2 , 6G5, 10C5, and 4D1. 
Peripheral blood lymphocytes were isolated from whole 
blood of chimpanzee, rhesus, and cynomolgus monkeys. The 
isolated cells were double stained with human antibody 
from each of these 5 hybridomas (detected with 
FITC-anti-human IgG) and PE-anti-CD8 or PE-anti-CD4. The 
stained cells were then analyzed by flow cytometry to 
determine if each of the human monoclonal antibodies 
bound to endogenous CD4 on the surface of lymphocytes 
from each of these three non-human primates. Four of the 
five antibodies, 1E11, 6G5, IOCS, and 4D1, were found to 
bind to chimpanzee CD4 cells. Additionally, Four of the 
five antibodies, 6G5, 1G2 , IOCS, and 4D1, were found to 
bind to both rhesus and cynomolgus CD4 cells. Thus, 
three of five antibodies, 6G5, 10C5, and 4D1, bind to CD4 
cells in each of the three non-human primate species 
tested. 

Example 44. 

There are no known in vitro assays that can 
reliably predict whether a monoclonal antibody (mAb) will 
be nondepleting or immunosuppressive in patients. 
However, a correlation has been observed between the 



ability of three different mAbs to deplete (or not 
deplete) in humans and nonhuman primates such as 
chimpanzees and cynomolgus monkeys (See, e.g., M. Jonker 
et al., Clin. Exp. Immunol., 93:301-307 (1993); and J. A. 
Powelson et al . , Transplantation, 57:788-793 (1994)). 
Therefore a study was performed using human mAbs in 
nonhuman primates. 

Chimpanzees were used in this study, because 
one of the anti-CD4 mAbs, 1E11, recognizes CD-4 only in 
chimpanzees and not in Rhesus or cynomolgus monkeys. A 
second mAb, 6G5 f recognizes CD4 in chimpanzees, Rhesus 
and cynomolgus monkeys. A third mAb, 1G2 , does not 
recognize CD4 in chimpanzees, but does in Rhesus and 
cynomolgus monkeys. That mAb has already been shown to 
be nondepleting in vivo in cynomolgus monkeys. 

In addition to examining the effect of human 
mAbs on CD4+ T cell numbers in peripheral blood, the 
effect of the mAb administration on in vivo T cell 
function was also evaluated. The most accepted manner to 
do this is to use animals that have been presensitized to 
an antigen such as tuberculin or tetanus toxoid and who 
will mount a hypersensitivity reaction in the skin. 

Three male chimpanzees were enrolled in this 
study. Baseline whole blood samples were obtained on 
days -7, -3 and 1. After the blood draw on day 1, one 
chimpanzee each was intravenously infused with one of the 
two human mAbs (1E11 or 6G5) at 2 mg/kg. The third 
chimpanzee received an equal volume/kg of buffer only. 
Blood was drawn at 3 0 mins, 2 hrs, 8 hrs, 24 hrs and 48 
hrs post-infusion. On day 2, a skin reactivity test was 
performed. 

Results shown in Table 25 below clearly 
demonstrate that 1E11 caused transient depletion of 
peripheral lymphocytes, with most CD4+ T cells being 
depleted. Even though 6G5 did not cause lymphocyte or 
CD4+ T cell depletion, both mAbs were able to inhibit a 
hypersensitivity response to tetanus toxoid, compared to 
the control chimpanzee. Thus, both human mAbs appear to 
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be immunosuppressive in vivo, and this immunosuppression 
does not necessarily require T cell depletion. 

Table 2 5 . Effect of Human mAbs on Peripheral 
5 Chimpanzee Lymphocytes 

Peripheral Lymphocytes (million/ml) 





Study Day 


_L i_i X X 


6G5 


Control 




-7 




4.2 


6.4 


4.2 




-4 




4.0 


9.9 


4.4 


10 


1/ 


pre-infusion 


4.8 


5.7 


5.8 




1, 


3 0 min post 


1.6 


6.0 


4.0 




1/ 


2 hr post 


1.0 


6.7 


5.2 




If 


6 hr post 


1.5 


8 . 0 


4.2 




2 




3.5 


9.6 


5.7 


15 


3 




3.9 


9.7 


5.9 



The foregoing description of the preferred 
embodiments of the present invention has been presented 
for purposes of illustration and description. They are 
not intended to be exhaustive or to limit the invention 
20 to the precise form disclosed, and many modifications and 

variations are possible in light of the above teaching. 
It will be apparent that certain changes and 
modifications may be practiced within the scope of the 
claims. 

25 All publications and patent applications herein 

are incorporated by reference to the same extent as if 
each individual publication or patent application was 
specifically and individually indicated to be 
incorporated by reference. Commonly assigned 

30 applications U.S. S.N. 08/544,404 filed 10 October 1995, 

U.S. S.N. 08/352,322, filed 7 December 1994, U.S. S.N. 
08/209,741 filed 9 March 1994, U.S. S.N. 08/165,699 filed 
10 December 1993 and U.S. S.N. 08/161,739 filed 03 
December 1993, which is a continuation-in-part of 



08/155,301 filed 18 
07/810,279 filed 17 
18 March 1992, USSN 
07/990,860 filed 16 
5 08/053,131 filed 26 

herein by reference 
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November 1993, WO92/03 918, USSN 
December 1991, USSN 07/853,408 filed 
07/904,068 filed 23 June 1992, USSN 
December 1992, W093/ 12227, and USSN 
April 1993 are each incorporated 
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1 We claim: 

2 1. A method for producing a human sequence immunoglobulin 

3 polypeptide comprising the steps: 

4 (A) obtaining a transgenic mouse, wherein said 

5 transgenic mouse comprises a homozygous pair of 

6 functionally disrupted endogenous heavy chain 

7 alleles, a homozygous pair of functionally disrupted 

8 endogenous light chain alleles, at least one copy of 

9 a human immunoglobulin light chain transgene, and at 

10 least one copy of a human immunoglobulin heavy chain 

11 transgene; 

12 (B) immunizing the transgenic mouse with a predetermined 

13 antigen, wherein an immune response is induced and 

14 whereby a transgene of the mouse undergoes V-D-J 

15 rearrangement, whereby a rearranged transgene is 

16 produced; 

17 (C) isolating and sequencing a nucleic acid encoding at 

18 least a portion of a variable region of the 

19 rearranged transgene, and determining the amino acid 
2 0 sequence of an immunoglobulin polypeptide encoded by 

21 the sequenced portion of the rearranged transgene; 

22 (D) making an artificial gene that encodes a second 

23 immunoglobulin polypeptide, wherein the second 

24 immunoglobulin polypeptide comprises an amino acid 

25 sequence that is substantially similar to the amino 

2 6 acid sequence of the immunoglobulin polypeptide 

27 encoded by the sequenced portion of the rearranged 

28 transgene; 

29 (E) linking the artificial gene to a transcription 

3 0 promoter sequence; and 

31 (F) introducing the artificial gene into a cell; 

32 whereby a human sequence immunoglobulin polypeptide is 

33 produced. 

1 2. The method of claim 1 wherein the second immunoglobulin 

2 polypeptide comprises an amino acid sequence that is the 

3 same as the amino acid sequence of the immunoglobulin 

4 polypeptide encoded by the sequenced portion of the 

5 rearranged transgene . 

1 3* The method of claim 1 wherein the nucleic acid of step 

2 (c) is isolated from a hybridoma secreting a human 

3 sequence immunoglobulin. 

1 4. The method of claim 1 wherein the rearranged transgene is 

2 a human immunoglobulin heavy chain transgene. 

1 5. The method of claim 4 wherein the artificial gene encodes 

2 a gamma isotype constant region. 

1 6. The method of claim 4, further comprising introducing a 

2 second artificial gene into the cell of step (F) , wherein 

3 the second artificial gene is made according to steps A-E 

4 of claim 4 except that the rearranged transgene is a 
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5 human immunoglobulin light chain transgene, and wherein 

6 the cell produces an immunoglobulin. 

1 7. The method of claim 6 wherein the human immunoglobulin 

2 heavy chain transgene and the human immunoglobulin light 

3 chain transgene are expressed in a hybridoma produced by 

4 fusing a single B cell from a transgenic mouse to an 

5 immortal cell. 

1 8. The method of claim 6 wherein the cell produces an 

2 immunoglobulin that binds the predetermined antigen with 

3 an affinity constant (K a ) of at least about 10 8 M" 1 . 

1 9. An immunoglobulin produced by the method of claim 1. 

1 10 . A cell comprising at least one artificial gene encoding 

2 at least a portion of an immunoglobulin polypeptide, 

3 wherein the cell produces a detectable amount of an 

4 immunoglobulin that binds a predetermined human antigen, 

5 and wherein the immunoglobulin polypeptide has 

6 substantially the same sequence as an immunoglobulin 

7 polypeptide secreted by a hybridoma obtained from a 

8 transgenic mouse, said mouse comprising a homozygous pair 

9 of functionally disrupted endogenous heavy chain alleles, 

10 a homozygous pair of functionally disrupted endogenous 

11 light chain alleles, at least one copy of a human 

12 immunoglobulin light chain transgene, and at least one 

13 copy of a human immunoglobulin heavy chain transgene. 

1 11. A eukaryotic cell of claim 10. 

1 12. The cell of claim 11 wherein the immunoglobulin binds 

2 human CD4 or an antigenic fragment thereof. 

1 13. An immunoglobulin that specifically binds human CD4, 

2 wherein said immunoglobulin comprises a human sequence 

3 light chain variable region comprising an amino acid 

4 sequence substantially identical to an amino acid 

5 sequence encoded by Seq. I.D. No. 1, Seq. I.D. No. 2, 

6 Seq. I.D. No. 3, Seq. I.D. No. 4, Seq. I.D. No. 5, Seq. 

7 I.D. No. 6, Seq. I.D. No. 7, Seq. I.D. No. 8, Seq. I.D. 

8 No. 9, or Seq. I.D., No. 10. 

1 14. The immunoglobulin of claim 13 wherein the immunoglobulin 

2 comprises the sequence of Seq. I.D. No. 61 or Seq. I.D. 

3 No. 62. 

1 15. A method for selecting a hybridoma secreting a human 

2 sequence immunoglobulin that specifically bind a 

3 preselected antigen comprising the steps: 

4 a) obtaining B cells from a trangenic mouse, wherein 

5 the transgenic mouse comprises a homozygous pair of 

6 functionally disrupted endogenous heavy chain 

7 alleles, a homozygous pair of functionally disrupted 

8 endogenous light chain alleles, at least one copy of 



a human immunoglobulin light chain transgene, and at 
least one copy of a human immunoglobulin heavy chain 
transgene comprising sequences encoding and non-jz 
segments, and wherein the mouse has been immunized 
with the predetermined antigen; 

b) fusing the B cells to immortal cells wherein 
hybridomas are produced; 

c) identifying a first group of hybridomas that secrete 
non-ju, non-5 isotype immunoglobulins; 

d) identifying a second group of hybridomas, wherein 
the second group is a subset of the first group of 
hybridomas, and wherein hybridomas in the second 
secrete immunoglobulins that specifically bind the 
predetermined antigen; 

e) selecting from the second group of hybridomas a 
hybridoma secreting a human sequence immunoglobulin 
that specifically binds to the preselected antigen. 



The method of claim 15 wherein the first group of 
hybridomas secrete IgG immunoglobulins. 

The method of claim 15 wherein the human sequence 
immunoglobulin of step (e) binds the predetermined 
antigen with an affinity constant (K a ) of at least about 
10 9 M" 1 . 

A human anti-CD4 immunoglobulin that specifically binds 
CD4 from humans and specifically binds CD4 from at least 
one non-human primate. 

The immunoglobulin of claim 18 wherein the non-human 
primate is Rhesus monkey, cynomolgus monkey, or 
chimpanzee. 

The immunoglobulin of claim 19 wherein the human anti-CD4 
immunoglobulin specifically binds CD4 from both rhesus 
monkey and cynomolgus monkey. 

The immunoglobulin of claim 19 wherein the human anti-CD4 
immunoglobulin specifically binds CD4 from rhesus monkey, 
cynomolgus monkey and chimpanzee. 

A human sequence immunoglobulin comprising a VH4-34 
segment, a DXP'l segment, a JH4 segment, and a heavy 
chain CDR3 region comprising the sequence DITMVRGPH [Seq. 
I. D. No. 63]. 

A human sequence immunoglobulin comprising a VH5-51 
segment, a DHQ52 segment, a JH2 segment, and a heavy 
chain CDR3 region comprising the sequence PANWNWYFVL 
[Seq. I.D. No. 64] . 

A human sequence immunoglobulin comprising a VH4-34 
segment, a JH5 segment, and a heavy chain CDR3 region 
comprising the sequence VINWFDP [Seq. I.D. No. 65]. 
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1 25. A human sequence immunoglobulin comprising a VH5-51 

2 segment, a DHQ52 segment, a JH4 segment , and a heavy 

3 chain CDR3 region comprising the sequence DQLGLFDY [Seq. 

4 I.D. No. 66]. 

1 26. A human sequence immunoglobulin comprising a VkA27/All 

2 segment, a Jk4 segment, and a light chain CDR3 region 

3 comprising the sequence QQYGSSPLT [Seq. I.D. No. 67]. 

1 27. A human sequence immunoglobulin comprising a VkL18 

2 segment, a Jk4 segment, and a light chain CDR3 region 

3 comprising the sequence QQFISYPQLT [Seq. I.D. No. 68]. 

1 28. A human sequence immunoglobulin comprising a VkL19 

2 segment, a Jk2 segment, and a light chain CDR3 region 

3 comprising the sequence QQANSFPYT [Seq. I.D. No. 69]. 

gl 29. A human sequence immunoglobulin comprising a VkL15 

m 2 segment, a Jk2 segment, and a light chain CDR3 region 

3 comprising the sequence QQYDSYPYT [Seq. I.D. No. 70]. 

1 30. A hybridoma secreting an immunoglobulin, wherein the 

%2 immunoglobulin is selected from the group consisting of: 

:j! 3 a human sequence immunoglobulin comprising a VH4-34 

f] 4 segment, a DXP'l segment, a JH4 segment, and a heavy 

y * 5 chain CDR3 region comprising the sequence DITMVRGPH [Seq. 

;\ 6 I.D. No. 63], a human sequence immunoglobulin comprising 

K ~~2 7 a VH5-51 segment, a DHQ52 segment, a JH2 segment, and a 

S 8 heavy chain CDR3 region comprising the sequence 

H 9 PANWNWYFVL [Seq. I.D. No. 64], a human sequence 

OL0 immunoglobulin comprising a VH4-34 segment, a JH5 

=£iLl segment, and a heavy chain CDR3 region comprising the 

yl2 sequence VINWFDP [Seq. I.D. No. 65], a human sequence 

13 immunoglobulin comprising a VH5-51 segment, a DHQ52 

14 segment, a JH4 segment , and a heavy chain CDR3 region 

15 comprising the sequence DQLGLFDY [Seq. I.D. No. 66], a 

16 human sequence immunoglobulin comprising a VkA27/All 

17 segment, a Jk4 segment, and a light chain CDR3 region 

18 comprising the sequence QQYGSSPLT [Seq. I.D. No. 67], a 

19 human sequence immunoglobulin comprising a VkL18 segment, 

20 a Jk4 segment, and a light chain CDR3 region comprising 

21 the sequence QQFISYPQLT [Seq. I.D. No. 68], a human 

22 sequence immunoglobulin comprising a VkL19 segment, a Jk2 

23 segment, and a light chain CDR3 region comprising the 

24 sequence QQANSFPYT [Seq. I.D. No. 69], and a human 

25 sequence immunoglobulin comprising a VkL15 segment, a Jk2 

26 segment, and a light chain CDR3 region comprising the 

27 sequence QQYDSYPYT [Seq. I.D. No. 70]. 
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(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/853,408 

(B) FILING DATE: 18-MAR-1992 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/810,279 

(B) FILING DATE: 17-DEC-1991 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/575,962 

(B) FILING DATE: 31-AUG-1990 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/574,748 

(B) FILING DATE: 29-AUG-1990 

fv-i i> PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: WO PCT/US91/06185 

(B) FILING DATE: 29-AUG-1991 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Apple, Randolph T. 

(B) REGISTRATION NUMBER: 36,42 9 

(C) REFERENCE /DOCKET NUMBER: 014643-00902 0 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 576-0200 

(B) TELEFAX: (415) 576-0300 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
CTAADTGGGG 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Asp Ala Phe Asp He 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GAGCTGAGCT GAGCTGAGCT GGGGT 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GAGCTGAGCT GAGCTGAGCT GAGCTGGGGT 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 : 
GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT GGGGT 



(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 
GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT GAGCTGGGGT 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT GGGGT 



(2) INFORMATION FOR SEQ ID NO : 16 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 16 : 
GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT GAGCTGGGGT 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 
GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT GGGGT 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 
GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT GAGCTGGGGT 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT 
GAGCTGAGCT GAGCTGGGGT 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT 
GAGCTGAGCT GAGCTGAGCT GGGGT 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT GAGCTGAGCT 
GAGCTGAGCT GAGCTGAGCT GAGCTGGGGT 



(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
AATTGCGGCC GC 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CAGATCTGCA TGCTCGAGCT CTAGATATCG ATGCTAGCGC GCCATGGATC C 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
AGGCCATTGC GGCCGCAGTA TGCAAAAAAA AGCCCGCTCA TTAGGCGGGC T 



(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
CGCGTGGCCG CAATGGCCA 



(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
CTAGTGGCCA TTGCGGCCA 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 
CAGGATC CAG AT AT C AGT AC CTGAAACAGG GCTTGC 
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(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
TGAGCCACGA AGACCCTGAG GTCAAGTTCA ACTGGTACGT GG 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
TGGTATTACT ATGGTTCGGG GAGTTATTAT AACCACAGTG TC 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
GCCTGAAATG GAGCCTCAGG GCACAGTGGG CACGGACACT GT 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 
GCAGGGAGGA CATGTTTAGG ATCTGAGGCC GCACCTGACA CC 
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(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 
AGCTTAGTTC GA 12 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
GAATGGGAGT GAGGCTCTCT CATACCCTAT TCAGAACTGA CT 42 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
GAACTGTGGC TGCACCATCT GTCTTCATCT TCCCGCCATC TG 42 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 



GAGGTACACT GACATACTGG CATGCCCCCC CCCCCC 
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(2) INFORMATION FOR SEQ ID NO: 51: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
GTACGCCATA TCAGCTGGAT GAAGTCAT C A GATGGCGGGA AGATGAAGAC AGATGGTGCA 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 
TCATCAGATG GCGGGAAGAT GAAGACAGAT GGTGCA 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
GTACGCCATA TCAGCTGGAT GAAG 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 54 : 
GAGGTACACT GACATACTGG CATG 



(2) INFORMATION FOR SEQ ID NO: 55: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
GTACGCCATA TCAGCTGGAT GAAGACAGGA GACGAGGGGG AAAAGGGTTG GGGCGGATGC 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 
ACAGGAGACG AGGGGGAAAA GGGTTGGGGC GGATGC 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
GTACTCCATA TCAGCTGGAT GAAG 



(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 
GGCTGATGCT GCACCAACTG TAT C CAT C TT CCCACCATCC AG 
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(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
GCGGCCGCTG TCGACAAGCT TCGAATTCAG ATCGATGTGG TACCTGGATC CTCGAGTGCG 6 0 

GCCGC 6 5 



(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
GGCCGCAAGC TTACTGCTGG ATCCTTAATT AAT CGATAGT GATCTCGAGG C 51 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE : DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
GGCCGCCTCG AGATCACTAT CGATTAATTA AGGATC CAGC AGTAAGCTTG C 51 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
CTCCAGGATC CAGATATCAG TACCTGAAAC AGGGCTTGC 



39 
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(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
CTCGAGCATG CACAGGACCT GGAGCACACA CAGCCTTCC 3 9 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3618 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME / KEY : - 

(B) LOCATION: 1..3618 

(D) OTHER INFORMATION: /note= "vector pGPe" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

AATTAGCGGC CGCCTCGAGA TCACTATCGA TTAATTAAGG ATCCAGATAT CAGTACCTGA 6 0 

AACAGGGCTG CTCACAACAT CTCTCTCTCT GTCTCTCTGT CTCTGTGTGT GTGTCTCTCT 12 0 

CTGTCTCTGT CTCTCTCTGT CTCTCTGTCT CTGTGTGTGT CTCTCTCTGT CTCTCTCTCT 180 

GTCTCTCTGT CTCTCTGTCT GTCTCTGTCT CTGTCTCTGT CTCTCTCTCT CTCTCTCTCT 24 0 

CTCTCTCTCT CTCTCTCACA CACACACACA CACACACACA CACACCTGCC GAGTGACTCA 3 00 

CTCTGTGCAG GGTTGGCCCT CGGGGCACAT GCAAATGGAT GTTTGTTCCA TGCAGAAAAA 360 

CATGTTTCTC ATTCTCTGAG C CAAAAAT AG CATCAATGAT TCCCCCACCC TGCAGCTGCA 42 0 

GGTTCACCCC ACCTGGCCAG GTTGACCAGC TTTGGGGATG GGGC TGGGGG TTCCATGACC 4 80 

CCTAACGGTG ACATTGAATT CAGTGTTTTC CCATTTATCG AC AC TGCTGG AATCTGACCC 54 0 

TAGGAGGGAA TGACAGGAGA TAGGCAAGGT CCAAACACCC CAGGGAAGTG GGAGAGACAG 6 00 

GAAGGCTGTG TGTGCTCCAG GTCCTGTGCA TGCTGCAGAT CTGAATTCCC GGGTACCAAG 66 0 

CTTGCGGCCG CAGTATGCAA AAAAAAGCCC GCT CATTAGG CGGGCTCTTG GCAGAACATA 72 0 

TCCATCGCGT CCGCCATCTC CAGCAGCCGC ACGCGGCGCA TCTCGGGCAG CGTTGGGTCC 78 0 

TGGCCACGGG TGCGCATGAT CGTGCTCCTG TCGTTGAGGA CCCGGCTAGG CTGGCGGGGT 840 

TGCCTTACTG GTTAGCAGAA TGAATCACCG ATACGCGAGC GAACGTGAAG CGACTGCTGC 90 0 

TGCAAAACGT CTGCGACCTG AGCAACAACA TGAATGGTCT TCGGTTTCCG TGTTTCGTAA 96 0 
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TCAGAAGTAA 


GTTGGCCGCA 


GTGTTATCAC 


TCATGGTTAT 


GGCAGCACTG 


CATAATTCTC 


3060 


TTACTGTCAT 


GCCATCCGTA 


AGATGCTTTT 


CTGTGACTGG 


TGAGTACTCA 


ACCAAGTCAT 


3120 


TCTGAGAATA 


GTGTATGCGG 


CGACCGAGTT 


GCTCTTGCCC 


GGCGTCAACA 


CGGGATAATA 


3180 


CCGCGCCACA 


TAGCAGAACT 


TTAAAAGTGC 


TCATCATTGG 


AAAACGTTCT 


TCGGGGCGAA 


3240 


AACTCTCAAG 


GATCTTACCG 


CTGTTGAGAT 


CCAGTTCGAT 


GTAACGCACT 


CGTGCACCCA 


3300 


ACTGATCTTC 


AGCATCTTTT 


ACTTTCACGA 


GCGTTTCTGG 


GTGAGCAAAA 


ACAGGAAGGC 


3360 


AAAATGCCGC 


AAAAAAGGGA 


ATAAGGGCGA 


CACGGAAATG 


TTGAATACTC 


ATACTCTTCC 


3420 


TTTTTCAATA 


TTATTGAAGC 


ATTTATCAGG 


GTTATTGTCT 


GATGAGCGGA 


TACATATTTG 


3480 


AATGTATTTA 


GAAAAATAAA 


CAAATAGGGG 


TTCCGCGCAC 


ATTTCCCCGA 


AAAGTGC CAC 


3540 


CTGACGTCTA 


AGAAACCATT 


ATTATCATGA 


CATTAACCTA 


TAAAAATAGG 


CGTATCACGA 


3600 


GGCCCTTTCG 


TCTTCAAG 
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(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
GGACTGTGTC CCTGTGTGAT GCTTTTGATG TCTGGGGCCA AG 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 74 : 
CACCAAGTTG ACCTGCCTGG TCACAGACCT GACCACCTAT GA 



(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: j oin (241 . . 286 , 373.. 677) 

(D) OTHER INFORMATION: /note= "human V-HI family gene V-H49.8" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

TTCCTCAGGC AGGATTTAGG GCTTGGTCTC TCAGCATCCC ACACTTGTAC AGCTGATGTG 60 

GCATCTGTGT TTTCTTTCTC ATCCTAGATC AAGCTTTGAG CTGTGAAATA CCCTGCCTCA 12 0 

TGAATATGCA AATAATCTGA GGTCTTCTGA GATAAATATA GATATATTGG TGCCCTGAGA 18 0 

GCATCACATA ACAACCAGAT TCCTCCTCTA AAGAAGC CC C TGGGAGCAC A GCTCATCACC 24 0 

ATG GAC TGG ACC TGG AGG TTC CTC TTT GTG GTG GCA GCA GCT ACA G 286 
Met Asp Trp Thr Trp Arg Phe Leu Phe Val Val Ala Ala Ala Thr 
15 10 15 

GTAAGGGGCT TCCTAGTCCT AAGGCTGAGG AAGGGATCCT GGTTTAGTTA AAGAGGATTT 346 

TATTCACCCC TGTGTCCTCT CCACAG GT GTC CAG TCC CAG GTC CAG CTG GTG 3 98 

Gly Val Gin Ser Gin Val Gin Leu Val 
20 

CAG TCT GGG GCT GAG GTG AAG AAG CCT GGG TCC TCG GTG AAG GTC TCC 446 
Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Ser Ser Val Lys Val Ser 
25 30 35 40 

TGC AAG GCT TCT GGA GGC ACC TTC AGC AGC TAT GCT ATC AGC TGG GTG 494 
Cys Lys Ala Ser Gly Gly Thr Phe Ser Ser Tyr Ala He Ser Trp Val 
45 50 55 

CGA CAG GCC CCT GGA CAA GGG CTT GAG TGG ATG GGA AGG ATC ATC CCT 542 
Arg Gin Ala Pro Gly Gin Gly Leu Glu Trp Met Gly Arg He He Pro 
60 65 70 

ATC CTT GGT ATA GCA AAC TAC GCA CAG AAG TTC CAG GGC AGA GTC ACG 59 0 

He Leu Gly He Ala Asn Tyr Ala Gin Lys Phe Gin Gly Arg Val Thr 
75 80 85 

ATT ACC GCG GAC AAA TCC ACG AGC ACA GCC TAC ATG GAG CTG AGC AGC 63 8 

He Thr Ala Asp Lys Ser Thr Ser Thr Ala Tyr Met Glu Leu Ser Ser 
90 95 100 

CTG AGA TCT GAG GAC ACG GCC GTG TAT TAC TGT GCG AGA GACACAGTGT 68 7 

Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys Ala Arg 
105 HO 115 

GAAAAC C C AC ATC CTGAGAG TGTCAGAAAC CCTGAGGGAG AAGGCAGCTG TGCCGGGCTG . 747 
AGGAGATGAC AGGGTTTATT AGGTTTAAGG CTGTTTACAA AATGGGTTAT ATATTTGAGA 807 
AAAAA 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 



812 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Met Asp Trp Thr Trp Arg Phe Leu Phe Val Val Ala Ala Ala Thr Gly 
15 10 15 

Val Gin Ser Gin Val Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lys 
20 25 30 

Pro Gly Ser Ser Val Lys Val Ser Cys Lys Ala Ser Gly Gly Thr Phe 
35 40 45 

Ser Ser Tyr Ala lie Ser Trp Val Arg Gin Ala Pro Gly Gin Gly Leu 
50 55 60 

Glu Trp Met Gly Arg lie lie Pro lie Leu Gly lie Ala Asn Tyr Ala 
65 70 75 80 

Gin Lys Phe Gin Gly Arg Val Thr lie Thr Ala Asp Lys Ser Thr Ser 
85 90 95 

Thr Ala Tyr Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val 
100 105 110 

Tyr Tyr Cys Ala Arg 
115 



(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
CCGGTCGACC GG 



(2) INFORMATION FOR SEQ ID NO:82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 82 : 
CTAGCTCGAG TCCAAGGAGT CTGTGCCGAG GTGCAGCTGN 



(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 
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(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:83: 
GTTGCTCGAG TGAAAGGTGT CCAGTGTGAG GTGCAGCTGN 4 0 



(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
GGCGCTCGAG TTCCACGACA CCGTCACCGG TTC 3 3 



(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
CCTGCTCGAG GCAGCCAACG GCCACGCTGC TCG 3 3 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 8 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
TACTGTGCGA GACGGCTAAA CTGGGGTTGA TGCTTTTGAT ATCTGGGGCC AAGGGACAAT 6 0 

GGTCACCGTC TCTTCAGCCT CCACCAAG 88 



(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 
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(2) INFORMATION FOR SEQ ID NO:63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 
AATTAGCGGC CGCAGTATGC AAAAAAAAGC CCGCTCATTA GGCGGGCT 48 



(2) INFORMATION FOR SEQ ID NO:64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: 
GCGGCCGCCT CGAGATCACT ATCGATTAAT TAAGGATCCA GCAGTAAGCT TGCGGCCGC 59 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
GCGGCCGCAT CCCGGGTCTC GAGGTCGACA AGCTTTCGAG GATCCGCGGC CGC 53 



(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
GCGGCCGCTG TCGACAAGCT TATCGATGGA TCCTCGAGTG CGGCCGC 



47 
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AGTCTGGAAA CGCGGAAGTC AGCGCCCTGC ACCATTATGT TCCGGATCTG CATCGCAGGA 102 0 

TGCTGCTGGC TACCCTGTGG AACACCTACA TCTGTATTAA CGAAGCGCTG GCATTGACCC 10 80 

TGAGTGATTT TTCTCTGGTC CCGCCGCATC CATACCGCCA GTTGTTTACC CTCACAACGT 114 0 

TCCAGTAACC GGGCATGTTC ATCATCAGTA ACCCGTATCG TCACGATCCT CTCTCGTTTC 12 0 0 

ATCGGTATCA TTACCCCCAT GAACAGAAAT TCCCCCTTAC ACGGAGGCAT CAAGTGACCA 12 6 0 

AACAGGAAAA AACCGCCCTT AACATGGCCC GCTTTATCAG AAGCCAGACA TTAACGCTTC 13 2 0 

TGGAGAAACT CAACGAGCTG GACGCGGATG AACAGGCAGA CAT CTGTGAA TCGCTTCACG 13 8 0 

ACCACGCTGA TGAGCTTTAC CGCAGCTGCC TCGCGCGTTT CGGTGATGAC GGTGAAAACC 144 0 

TCTGACACAT GCAGCTCCCG GAGACGGTCA CAGCTTGTCT GTAAGCGGAT GCCGGGAGCA 15 0 0 

GACAAGCCCG TCAGGGCGCG TCAGCGGGTG TTGGCGGGTG TGGGGGCGCA GCCATGACCC 156 0 

AGTCACGTAG CGATAGCGGA GTGTATACTG GCTTAACTAT GCGGCATCAG AGCAGATTGT 162 0 

ACTGAGAGTG CACCATATGC GGTGTGAAAT ACCGCACAGA TGCGTAAGGA GAAAATACCG 168 0 

CATCAGGCGC TCTTCCGCTT CCTCGCTCAC TGACTCGCTG CGCTCGGTCG TTCGGCTGCG 174 0 

GCGAGCGGTA TCAGCTCACT CAAAGGCGGT AATACGGTTA TCCACAGAAT CAGGGGATAA 18 0 0 

CGCAGGAAAG AACATGTGAG CAAAAGGCCA GCAAAAGGCC AGGAAC CGTA AAAAGGCCGC 186 0 

GTTGCTGGCG TTTTTCCATA GGCTCCGCCC CCCTGACGAG CATCACAAAA ATCGACGCTC 192 0 

AAGTCAGAGG TGGCGAAACC CGACAGGACT ATAAAGATAC CAGGCGTTTC CCCCTGGAAG 198 0 

CTCCCTCGTG CGCTCTCCTT AGGTATCTCA GTTCGGTGTA GGTCGTTCGC TCCAAGCTGG 2 04 0 

GCTGTGTGCA CGAACCCCCC GTTCAGCCCG ACCGCTGCGC CTTATCCGGT AACTATCGTC 2100 

TTGAGTC CAA CCCGGTAAGA CACGACTTAT CGCCACTGGC AGCAGC CACT GGTAACAGGA 216 0 

TTAGCAGAGC GAGGTATGTA GGCGGTGGTA CAGAGTTCTT GAAGTGGTGG CCTAACTACG 222 0 

GCTACACTAG AAGGACAGTA TTTGGTATCT GCGCTCTGCT GAAGC CAGTT ACCTTCGGAA 228 0 

AAAGAGTTGG TAGCTCTTGA TCCGGCAAAC AAACCACCGC TGGTAGCGGT GGTTTTTTTG 2 34 0 

TTTGCAAGCA GCAGATTACG CGCAGAAAAA AAGGATCTCA AGAAGATCCT TTGATCTTTT 24 0 0 

CTACGGGGTC TGACGCTCAG TGGAACGAAA ACTCACGTTA AGGGATTTTG GTCATGAGAT 246 0 

TATCAAAAAG GATCTTCACC TAGATCCTTT TAAATTAAAA ATGAAGTTTT AAATCAATCT 2 52 0 

AAAGTATATA TGAGTAAACT TGGTCTGACA GTTACCAATG CTTAATGAGT GAGGCAGGTA 2580 

TCTCAGCGAT CTGTCTATTT CGTTCATCCA TAGTTGCCTG ACTCCCCGTC GTGTAGATAA 2 64 0 

CTACGATACG GGAGGGCTTA CCATCTGGCC CCAGTGCTGC AATGATACCG CGAGACCCAC 2 70 0 

GCTCACCGGC TCCAGATTTA TCAGCAATAA ACCAGCCAGC CGGAAGGGCC GAGCGCAGAA 2 76 0 

GTGGTCCTGC AACTTTATCC GCCTCCATCC AGTCTATTAA TTGTTGCCGG GAAGCTAGAG 2 82 0 

TAAGTAGTTC GCCAGTTAAT AGTTTGCGCA ACGTTGTTGC CATTGCTGCA GGCATCGTGG 28 8 0 

TGTCACGCTC GTCGTTTGGT ATGGCTTCAT TCAGCTCCGG TTCCCAACGA TCAAGGCGAG 2 940 

TTACATGATC CCCCATGTTG TGCAAAAAAG CGGTTAGCTC CTTCGGTCCT CCGATCGTTG 3 0 00 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
TACTGTGCGA GACACCGTAT AGCAGCAGCT GGCTTTGACT ACTGGGGCCA GGGAACCCTG 6 0 

GTCACCGTCT CCTCAGCCTC CACCAAG 8 7 



(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
TACTGTGCGA GATATTACTA CTACTACTAC GGTATGGACG TCTGGGGCCA AGGGAC CACG 6 0 

GTCACCGTCT CCTCAGCCTC CACCAAG 87 



(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
TACTGTGCGA GACATTACGA TATTTTGACT GGTCCTACTA C T AC T AC GG T ATGGACGTCT 
GGGGCCAAGG GACCACGGTC ACCGTCTCCT CAGCCTCCAC CAAG 



(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



60 
104 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:90: 
TACTGTGCGA GACGGAGGTA CTATGGTTCG GGGAGTTATT ATAACGTCTT TGACTACTGG 
GGCCAGGGAA CCTGGTCACC GTCTCCTCAG CCTCCACCAA G 



60 
101 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
TACTGTGCGA GATACTTCCA GCACTGGGGC CAGGGCACCC TGGTCACCGT CTCCTCAGGG 
AGTGCGTCC 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
TACTGTGCGA GACACGTAGC TAACTCTTTT GACTACTGGG GCCAGGGAAC CCTGGTCACC 
GTCTCCTCAG GGAGTGCATC C 



(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:96: 
TACTGTGCGA GACAAATTAC TATGGTTCGG GGAGTTCCCT TTGACTACTG GGGCCAGGGA 
ACCCTGGTCA CCGTCTCCTC AGGGAGTGCA TCC 



(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

( i i ) MOLECULE TYPE : DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:97: 
TACTGTGCGA GACAATACTT CCAGCACTGG GGCCAGGGCA CCCTGGTCAC CGTCTCCTCA 



GGGAGTGCAT CC 
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(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
TACTGTGCGA GACGGGGGGT GTCTGATGCT TTTGATATCT GGGGCCAAGG GACAATGGTC 
ACCGTCTCTT CAGCCTCCAC CAAG 



(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 92 : 
TACTGTGCGA GAGCAACTGG CGCTTTTGAT ATCTGGGGCC AAGGGACAAT GGTCACCGTC 
TCTTCAGGGA GTGCATCC 



(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 99 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
TACTGTGCGA GATCGGCTAA CTGGGGATCC TACTACTACT ACGGTATGGA CGTCTGGGGC 
C AAGGGAC C A CGGTCACCGT CTCCTCAGGG AGTGCATCC 



(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 9 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 
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(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO : 98 : 

TACTGTGCGA GACAAACTGG GGACTACTAC TACTACGGTA TGGACGTCTG GGGCCAAGGG 6 0 

ACCACGGTCA CCGTCTCCTC AGGGAGTGCA TCC 93 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
TACTGTGCGA GACATTACTA TGGTTCGGGG AGTTATGACT ACTACTACTA CGGTATGGAC 6 0 

GTCTGGGGCC AAGGGAC C AG GTCACCGTCT CCTCAGGGAG TGCATCC 107 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
TACTGTGCGA GACAGGGAGT GGGGCCAGGG AACCCTGGTC ACCGTCTCCT CAGCCTCCAC 60 
CAAG 



64 



(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
TACTGTGCGA GATTCTGGGA GACTGGTTCG ACCCCTGGGG CCAGGGAACC CTGGTCACCG 
TCTCCTCAGG GAGTGCATCC 



(2) INFORMATION FOR SEQ ID NO:102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
TACTGTGCGA GACGGAGGTA CTATGGTTCG GGGAGTTATT ATAACGTCTT TGACTACTGG 
GGC CAGGGAA CCCTGGTCAC CGTCTCCTCA GCCTCCACCA AG 



(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
TACTGTGCGA GACAAACCTG GGGAGGAGAC TACTGGGGCC AGGGAACCCT GGTCACCGTC 
TCCTCAGCCT CCACCAAG 



(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
TACTGTGCGA GAGGATATAG TGGCTACGAT AACTACTACT ACGGTATGGA CGTCTGGGGC 
CAAGGGACCA CGGTCACCGT CTCCTCAGCC TCCACCAAG 
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(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
TACTGTGCGA GACAAACTGG GGAGGACTAC TTTGACTACT GGGGCCAGGG AACCCTGGTC 60 
ACCGTCTCCT CAGGGAGTGC ATCC 84 



(2) INFORMATION FOR SEQ ID NO:106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
TACTGRGCGA GATATAGTGG C T AC GAT T AC CTACTGGTAC TTCGATCTCT GGGGCCGTGG 6 0 

CACCCTGGTC ACCGTCTCCT CAGCCTCCAC CAAG 94 



(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:107: 
TACTGRGCGA GAGCATCCCT CCCCTCCTTT GACTACTACG GTATGGACGT CTGGGGCCAA 6 0 

GGGACCACGG TCACCGTCTC CTCAGCCTCC AC CAAG 96 



(2) INFORMATION FOR SEQ ID NO:108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:108: 
TACTGTGCGA GACGGGGTGG GGTTTGACTA CTGGGGCCAG GGAACCCTGG TCACCGTCTC 6 0 

CTCAGGGAGT GCATCC 76 

(2) INFORMATION FOR SEQ ID NO: 10 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
TACTGTGTGC CGGTCGAAAC TTTACTACTA CTACTACGGT ATGGACGTCT GGGGCCAAGG 6 0 

GACCACGGTC ACCGTCTCCT CAGGGAGTGC ATCC 94 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
TACTGTGCGA GAGATATTTT GACTGGTTAA CGTGACTACT GGGGCCAGGG AACCCTGGTC 6 0 

ACCGTCTCCT CAGGGAGTGC ATCC 84 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

( C ) S TRANDEDNES S : s ingl e 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
TACTGTGCGA GACATGGTAT AGCAGCAGCT GGTACTGCTT TTGATATCTG GGGC CAAGGG 
ACAATGGTCA CCGTCTCTTC AGGGAGTGCA TCC 



60 
93 



I 
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(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
TACTGTGTGA GATCAACTGG GGTTGATGCT TTTGATATCT GGGGCCAAGG GACAATGGTC 60 
ACCGTCTCTT CAGGGAGTGC ATCC 84 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 
TACTGTGCGG AAATAGCAGC AGCTGCCCTA CTTTGACTAC TGGGGCCAGG GAACCCTGGT 6 0 

CACCGTCTCC TCAGGGAGTG CATCC 8 5 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 
TACTGTGTGT GTATAGCAGC AGCTGGTAAA GGAAACGGCT ACTGGGGCCA GGGAACCCTG 6 0 

GTCACCGTCT CCTCAGGGAG TGCATCC 8 7 



(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

( C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
TACTGTGCGA GACAAAACTG GGGTGACTAC TGGGGCCAGG GAACCCTGGT CACCGTCTCC 6 0 

TCAGGGAGTG CATCC 7 5 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

Tyr Cys Ala Arg Arg Leu Thr Gly Val Asp Ala Phe Asp lie Trp Gly 
15 10 15 

Gin Gly Thr Met Val Thr Met Ser Ser Ala Ser Thr Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

Tyr Cys Ala Arg His Arg He Ala Ala Ala Gly Phe Asp Tyr Trp Gly 
15 10 15 

Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

Tyr Cys Ala Arg Tyr Tyr Tyr Tyr Tyr Tyr Gly Met Asp Val Trp Gly 
15 10 15 

Gin Gly Thr Thr Val Thr Val Ser Ser Ala Ser Thr Lys 
20 25 
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(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

Tyr Cys Ala Arg His Tyr Asp lie Leu Thr Gly Pro Thr Thr Thr Thr 
15 10 15 

Val Trp Thr Ser Gly Ala Lys Gly Pro Arg Ser Pro Ser Pro Gin Pro 
20 25 30 

Pro Pro 



(2) INFORMATION FOR SEQ ID NO:120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:120: 

Tyr Cys Ala Arg Arg Arg Tyr Tyr Gly Ser Gly Ser Tyr Tyr Asn Val 
15 10 15 

Phe Asp Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Asp 
20 25 30 

Thr Lys 



(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

Tyr Cys Ala Arg Arg Gly Val Ser Asp Ala Phe Asp lie Trp Gly Gin 
15 10 15 

Gly Thr Met Val Thr Val Ser Ser Ala Asp Thr Lys 
20 25 
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(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:122: 

Tyr Cys Ala Arg Ala Thr Gly Ala Phe Asp He Trp Gly Gin Gly Thr 
1 5 10 15 

Met Val Thr Val Ser Ser Gly Ser Ala Ser 
20 25 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:123: 

Tyr Cys Ala Arg Ser Ala Asn Trp Gly Ser Tyr Tyr Tyr Tyr Gly Met 
15 10 15 

Asp Val Trp Gly Gin Gly Thr Thr Val Thr Val Ser Ser Gly Ser Ala 
20 25 30 

Ser 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 amino acids 

(B) TYPE : amino acid 

( C ) S TRANDEDNES S : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

Tyr Cys Ala Arg Tyr Phe Gin His Trp Gly Gin Gly Thr Leu Val Thr 
15 10 15 

Val Ser Ser Gly Ser Ala Ser 
20 
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(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

Tyr Cys Ala Arg His Val Ala Asn Ser Phe Asp Tyr Trp Gly Gin Gly 
15 10 15 

Thr Leu Val Thr Val Ser Ser Gly Ser Ala Ser 
20 25 



(2) INFORMATION FOR SEQ ID NO: 126: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:126: 

Tyr Cys Ala Arg Gin lie Thr Met Val Arg Gly Val Pro Phe Asp Tyr 
15 10 15 

Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser Gly Ser Ala Ser 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 12 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

Tyr Cys Ala Arg Gin Tyr Phe Gin His Trp Gly Gin Gly Thr Leu Val 
15 10 15 

Thr Val Ser Ser Gly Ser Ala Ser 
20 
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(2) INFORMATION FOR SEQ ID NO: 12 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:128: 

Tyr Cys Ala Arg Gin Thr Gly Asp Tyr Tyr Tyr Tyr Gly Met Asp Val 
15 10 is 

Trp Gly Gin Gly Thr Thr Val Thr Val Ser Ser Gly Ser Ala Ser 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 12 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:129: 

Tyr Cys Ala Arg His Tyr Tyr Gly Ser Gly Ser Tyr Asp Tyr Tyr Tyr 
1 5 10 15 

Tyr Gly Met Asp Val Trp Gly Gin Gly Thr Thr Val Thr Val Ser Ser 
20 25 30 

Gly Ser Ala Ser 
35 



(2) INFORMATION FOR SEQ ID NO: 13 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:130: 

Tyr Cys Ala Arg Gin Gly Val Gly Pro Gly Asn Pro Gly His Arg Leu 
15 10 15 

Leu Ser Leu His Gin 
20 
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(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

Tyr Cys Val Arg Phe Trp Glu Thr Gly Ser Thr Pro Gly Ala Arg Glu 
15 10 15 

Pro Trp Ser Pro Ser Pro Gin Gly Val His 
20 25 

(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:132: 

Tyr Cys Ala Arg Arg Arg Tyr Tyr Gly Ser Gly Ser Tyr Tyr Asn Val 
15 10 15 

Phe Asp Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser Gly Ser 
20 25 30 

Thr Lys 

(2) INFORMATION FOR SEQ ID NO:133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:133: 

Tyr Cys Ala Arg Gin Thr Trp Gly Gly Asp Tyr Trp Gly Gin Gly Thr 
15 10 15 

Leu Val Thr Val Ser Ser Gly Ser Thr Lys 
20 25 
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(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

( C ) S TRANDEDNES S : 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:134: 

Tyr Cys Ala Arg Gly Tyr Ser Gly Tyr Asp Asn Tyr Tyr Tyr Gly lie 
1 5 10 15 

His Val Trp Gly Gin Gly Thr Thr Val Thr Val Ser Ser Gly Ser Thr 
20 25 30 

Lys 



(2) INFORMATION FOR SEQ ID NO: 13 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) S TRANDEDNES S : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 13 5: 

Tyr Cys Ala Arg Gin Thr Gly Glu Asp Tyr Phe Asp Tyr Trp Gly Gin 
15 10 15 

Gly Thr Leu Val Thr Val Ser Ser Gly Ser Ala Ser 
20 25 



(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

Tyr Cys Ala Arg Tyr Ser Gly Tyr Asp Tyr Leu Leu Val Leu Arg Ser 
15 10 15 

Leu Gly Pro Trp His Pro Gly His Cys Leu Leu Ser Leu His Arg 
20 25 30 
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(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 7: 

Tyr Cys Ala Arg Ala Ser Leu Pro Ser Phe Asp Tyr Tyr Gly Met Asp 
15 10 15 

Val Trp Gly Gin Gly Thr Thr Val Thr Val Ser Ser Gly Ser Thr Lys 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 13 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

Tyr Cys Ala Arg Arg Gly Gly Gly Leu Thr Thr Gly Ala Arg Glu Pro 
15 10 15 

Trp Ser Pro Ser Pro Gin Gly Val His 
20 25 



(2) INFORMATION FOR SEQ ID NO: 13 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 9: 

Tyr Cys Val Pro Val Glu Thr Leu Leu Leu Leu Leu Arg Tyr Gly Arg 
15 10 15 

Leu Gly Pro Arg Asp His Gly His Arg Leu Leu Arg Glu Cys lie 
20 25 30 
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(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:140: 

Tyr Cys Val Arg Asp lie Leu Thr Gly Glx Arg Asp Tyr Trp Gly Gin 
15 10 15 

Gly Thr Leu Val Thr Val Ser Ser Gly Ser Ala Ser 
20 25 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

Tyr Cys Ala Arg His Gly lie Ala Ala Ala Gly Thr Ala Phe Asp lie 
15 10 15 

Trp Gly Gin Gly Thr Met Val Thr Val Ser Ser Gly Ser Ala Ser 
20 25 30 

(2) INFORMATION FOR SEQ ID NO:142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

Tyr Cys Val Arg Ser Thr Gly Val Asp Ala Phe Asp He Trp Gly Gin 
15 10 15 

Gly Thr Met Val Thr Val Ser Ser Gly Ser Ala Ser 
20 25 

(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:143: 

Tyr Cys Ala Glu lie Ala Ala Ala Ala Leu Leu Leu Glx Leu Leu Gly 
15 10 15 

Pro Gly Asn Pro Gly His Arg Leu Leu Arg Glu Cys lie 
20 25 



(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

Tyr Cys Val Cys lie Ala Ala Ala Gly Lys Gly Asn Gly Tyr Trp Gly 
15 10 15 

Gin Gly Thr Leu Val Thr Val Ser Ser Gly Ser Ala Ser 
20 25 

(2) INFORMATION FOR SEQ ID NO: 14 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 

Tyr Cys Ala Arg Gin Asn Trp Gly Asp Tyr Trp Gly Gin Gly Thr Leu 
15 10 15 

Val Thr Val Ser Ser Gly Ser Ala Ser 
20 25 

(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 
TCAGTGAAGG TTTCCTGCAA GGCATCTGGA TACACCTTCA CC 
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(2) INFORMATION FOR SEQ ID NO: 14 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 
TCCCTGAGAC TCTCCTGTGC AGCCTCTGGA TTCACCTTCA GT 42 



(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:148: 
GGCCGCATCC CGGGTCTCGA GGTCGACAAG CTTTCGAGGA TCCGC 4 5 



(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 9: 
GGCCGCGGAT CCTCGAAAGC TTGTCGACCT CGAGACCCGG GATGC 45 



(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 
GGCCGCTGTC GACAAGCTTA TCGATGGATC CTCGAGTGC 



39 
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(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 
GGCCGCACTC GAGGATC CAT CGATAAGCTT GTCGACAGC 3 9 

(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:152: 
CACCTTCGGC CAAGGGACAC GACTGGAGAT TAAACGTAAG CA 42 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:153: 
AGGTTCAGTG GCAGTGGGTC TGGGACAGAC TTCACTCTCA CCATCAGC 48 

(2) INFORMATION FOR SEQ ID NO : 154 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 
GATCCTCGAG ACCAGGTACC AGATCTTGTG AATTCG 



36 
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(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 
TCGACGAATT CACAAGATCT GGTACCTGGT CTCGAG 



(2) INFORMATION FOR SEQ ID NO:156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 
CGCGGTACCG AGAGTCAGTC CTTCCCAAAT GTC 



(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 
CGCCTCGAGA CAGCTGGAAT GGGCACATGC AGA 



(2) INFORMATION FOR SEQ ID NO:158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
CGCGGTACCG CTGATGCTGC ACCAACTGTA TCC 



(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159 
CGCCTCGAGC TAACACTCAT TCCTGTTGAA GCT 



(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160 
GGCGCTCGAG CTGGACAGGG MTCCAKAGTT CCA 



(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161 
CCACACTCTG CATGCTGCAG AAGCTTTTCT GTA 



(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162 
GGTGACTGAG GTACCTTGAC CCCAGTAGTC CAG 
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(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 
GGTTACCTCA GTCACCGTCT CCTCAGAGGT AAGAATGGCC TC 42 



(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 
AGGCTCCACC AGACCTCTCT AGACAGCAAC TAC 33 



(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 
TGGGGTCAAG GAACCTCAGT CACCGTCTCC TCAGGTAAGA ATGGCCTCTC C 51 



(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

( C ) S TRANDEDNES S : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

Trp Gly Gin Gly Thr Ser Val Thr Val Ser Ser 
15 10 
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(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:167: 
TGGGGTCAAG GTACCTCAGT CACCGTCTCC TCAGAGGTAA GAATGGCCTC TCC 53 



(2) INFORMATION FOR SEQ ID NO: 16 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

Trp Gly Gin Gly Thr Ser Val Thr Val Ser Ser Glu 
15 10 



(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 
CTGGTCCTCA GAGAGTCAGT CCTTCCCAAA TGTCTTCCCC CTCGTC 46 



(2) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

Glu Ser Gin Ser Phe Pro Asn Val Phe Pro Leu Val 
15 10 



(2) INFORMATION FOR SEQ ID NO: 171: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 
CTGGTCCTCA GAGTCAGTCC TTCCCGAATG TCTTCCCCCT CGTC 



(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:172: 

Ser Gin Ser Phe Pro Asn Val Phe Pro Leu Val 
15 10 



(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:173: 
TAGAAGGAAT TCAGCAGGCA CACAACAGAG GCAGTTCCA 



(2) INFORMATION FOR SEQ ID NO : 174 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 38 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 
AGCTTCTCGA GCTCCTGCTG CTCTGTTTCC CAGGTGCC 
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(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 
CAGCTTCTCG AGCTCCTGCT ACTCTGGCTC MCAGATACC 



(2) INFORMATION FOR SEQ ID NO: 176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 
TATTACTGTG CGAGGGCTCC AACTGGGGAC TGGTTCGAC 



(2) INFORMATION FOR SEQ ID NO: 17 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 

Tyr Tyr Cys Ala Arg Ala Pro Thr Gly Asp Trp Phe Asp 
15 10 



(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:178: 
TATAATAGTT ACCCTCCTAC TTTCGGC 
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(2) INFORMATION FOR SEQ ID NO: 179: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 

Tyr Asn Ser Tyr Pro Pro Thr Phe Gly 
1 5 



(2) INFORMATION FOR SEQ ID NO:180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 
GGCGCGCCTT GGCCTAAGAG GCCA 



(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 
CCTCTTAGGC CAAGGCGCGC CTGG 



(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182 
AATTCAGTAT CGATGTGGTA C 
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(2) INFORMATION FOR SEQ ID NO:183: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 
C AC AT CGATA CTG 



(2) INFORMATION FOR SEQ ID NO : 184 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 
GTTTGCAGGT GTCCAGTGTS AGGTGCAGCT GKTGGAGTCY SG 



(2) INFORMATION FOR SEQ ID NO: 18 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 
CCGGTCGACC CG 



(2) INFORMATION FOR SEQ ID NO:186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 
TCACAAGCCC AGCAACACCA AG 
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(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 
AAAAGCCAGA AGACCCTCTC CCTG 



(2) INFORMATION FOR SEQ ID NO:188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 
CAATAGGGGT CATGGACCC 



(2) INFORMATION FOR SEQ ID NO: 18 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 9: 
TCATTCTGTG CAGAGTTGGC 



(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 
GTCCAGAATT CGGTBCAGCT GGTGSAGTCT GG 
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(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191 
GGTTTCTCGA GGAAGAGGAA GACTGACGGT CC 



(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192 
GACATCCAGC TGACCCAGTC TCC 



(2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193 
GATATTCAGC TGACTCAGTC TCC 



(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194 
GAAATTCAGC TGACGCAGTC TCC 



(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 
GAAACGCAGC TGACGCAGTC TCCC 



(2) INFORMATION FOR SEQ ID NO: 196: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:196: 
GCAAGCTTCT GTCCCAGACC CACTGCCACT GAACC 



(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 
CGGTTAACAT AGCCCTGGGA CGAGAC 



(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198 
GGGTTAACTC ATTGCCTCCA AAGCAC 
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(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 413 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199 








TGCACAAGAA 


CATGAAACAC 


CTGTGGTTCT 


TCCTCCTCCT 


GGTGGCAGCT 


CCCAGATGGG 


60 


TCCTGTCCCA 


GGTGCAGCTT 


CATCAGTGGG 


GCGCAGGACT 


GTTGAAGCCT 


TCGGAGACCC 


120 


TGTCCCTCAC 


CTGCGCTGTC 


TATGGTGGGT 


CCTTCAGTGG 


TTACTTCTGG 


AGCTGGATCC 


180 


GCCAGCCCCC 


AGGGAGGGGG 


CTGGAGTGGA 


TTGGGGAAAT 


CCATCATCGT 


GGAAGCACCA 


240 


ACTACAACCC 


GTCCCTCGAG 


AGTCGAGTCA 


CCCTATCAGT 


AGACACGTCC 


AAAAACCAGT 


300 


TCTCCCTGAG 


GCTGAGTTCT 


GTGACCGCCG 


CGGACACGGC 


TGTGTATTAC 


TGTGCGAGAG 


360 


ACATTACTAT 


GGTTCGGGGA 


GTACCTCACT 


GGGGCCAGGG 


AACCCTGGTC 


ACC 


413 



(2) INFORMATION FOR SEQ ID NO:200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 00: 
GACAGACTTC ACTCTCACCA TCAGCAGACT GGAGCCTGAA GATTTTGCAG TGTATTACTG 
TCAGCAGTAT GGTAGCTCAC CCCTCACTTT CGGCGGAGGG AC C AAGGTGG AGATCAAACG 
AACTGTGGCG GCACCATCTG TCTTCATCTT CCC 

(2) INFORMATION FOR SEQ ID NO: 2 01: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 414 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 01: 
TCCACCATCA TGGGGTCAAC CGCCATCCTC GCCCTCCTCC TGGCTGTTCT CCAAGGAGTC 
TGTGCCGAGG TGCAGCTGGT GCAGTCTGGA GCAGAGGTGA AAAAGCCCGG GGAGTCTCTG 
AAGATCTCCT GTAAGGGTTC TGGATACAGC TTTACCAGTT ACTGGATCGC CTGGGTGCGC 
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CAGATGCCCG GGAAAGGCCT GGAGTGGATG GGGATCATCG ATCCTGCTGA CTCTGATACC 
AGATACAACC CGTCCTTCCA AGGCCAGGTC ACCATCTGAG CCGACAAGTC CATCAGTACC 
GCCTATTTGC AGTGGAGCAG CCTGAAGGCC TCGGACACCG CCATGTATTA CTGTGCGAGA 
CCAGCGAACT GGAACTGGTA CTTCGTTCTC TGGGGCCGTG GCACCCTGGT CACT 

(2) INFORMATION FOR SEQ ID NO:202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 
GACAGATTTC ACTCTCACCA TCAGCAGCCT GCAGCCTGAA GATTTTGCAA CTTATTACTG 
TCAACAGTTT ATTAGTTACC CTCAGCTCAC TTTCGGCGGA GGGACCAGGG TGGAGATCAA 
ACGAACTGTG GCTGCACCAT CTGTCTTCAT CTTCCC 

(2) INFORMATION FOR SEQ ID NO: 2 03: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 04 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 



TGCACAAGAA 


CATGAAACAC 


CTGTGGTTCT 


TCCTCCTCCT 


GGTGGCAGCT 


CCCAGATGGG 


60 


TCCTGTCCCA 


GGTGCAGCTA 


CAGCAGTGGG 


GCGCAGGACT 


GTTGAAGCCT 


TCGGAGACCC 


120 


TGTCCCTCAC 


CTGCGCTGTC 


TATGGTGGGT 


CCTTCAGTGG 


TTACTACTGG 


AGCTGGATCC 


180 


GCCAGCCCCC 


AGGTAAGGGG 


CTGGAGTGGA 


TTGGGGAAAT 


CAATCATAGT 


GGAAGCACCA 


240 


ACTACAACCC 


GTCCCTCAAG 


AGTCGAGTCA 


C CAT AT C AGT 


CGACACGTCC 


AAGAAC CAGT 


300 


TCTCCCTGAA 


ACTGAGCTCT 


GTGACCGCCG 


CGGACACGGC 


TGTGTATTAC 


TGTGCGAGAG 


360 


TAATTAATTG 


GTTCGACCCC 


TGGGGCCAGG 


GAACCCTGGT 


CACC 




404 



(2) INFORMATION FOR SEQ ID NO: 2 04: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



240 
300 
360 
414 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 
GACAGATTTC ACTCTCACCA TCAGCAGCCT GCAGCCTGAA GATTTTGCAA CTTACTATTG 6 0 

TCAACAGGCT AATAGTTTCC CGTACACTTT TGGCCAGGGG ACCAAGCTGG AGATCAAACG 12 0 

AACTGTGGCT GCACCATCTG TCTTCATCTT CCC 153 

(2) INFORMATION FOR SEQ ID NO:205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 03 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

ATGAAAC AC C TGTGGTTCTT CCTCCTCCTG GTGGCAGCTC CCAGATGGGT CCTGTCCCAG 6 0 

GTGCAGCTAC AGCAGTGGGG CGCAGGACTG TTGAAGCCTT CGGAGACCCT GTCCCTCACC 12 0 

TGCGCTGTCT ATGGTGGGTC CTTCAGTGGT TACTACTGGA GCTGGATCCG CCAGCCCCCA 18 0 

GGTAAGGGGC TGGAGTGGAT TGGGGAAATC AATCATAGTG GAAGCAC CAA CTACAACCCG 24 0 

TCCCTCAAGA GTCGAGTCAC CATATCAGTC GACACGTCCA AGAACCAGTT CTCCCTGAAG 3 00 

CTGAGCTCTG TGACCGCCGC GGACACGGCT GTGTATTACT GTGCGAGAGT AATTAATTGG 36 0 

TTCGACCCCT GGGGCCAGGG AACCCTGGTC ACCGTCTCCT CAG 4 03 

(2) INFORMATION FOR SEQ ID NO: 2 06: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 

ATGGACATGA TGGTCCCCGC TCAGCTCCTG GGGCTCCTGC TGCTCTGGTT CCCAGGTTCC 60 

AGATGCGACA TCCAGATGAC CCAGTCTCCA TCTTCCGTGT CTGCATCTGT AGGAGACAGA 12 0 

GT CAC CATC A CTTGTCGGGC GAGTCAGGAT ATTAGCAGCT GGTTAGCCTG GTATCAGCAT 18 0 

AAACCAGGGA AAGCCCCTAA GCTCCTGATC TATGCTGCAT CCAGTTTGCA AAGTGGGGTC 240 

C CAT C AAGGT TCAGCGGCAG TGGATCTGGG ACAGATTTCA CTCTCACCAT CAGCAGCCTG 3 00 

CAGCCTGAAG ATTTTGCAAC TTACTATTGT CAACAGGCTA ATAGTTTCCC GTACACTTTT 3 60 

GGCCAGGGGA CCAAGCTGGA GATCAAAC 388 
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(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 462 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 

ATGGGGTCAA CCGCCATCCT CGCCCTCCTC CTGGCTGTTC TCCAAGGAGT CTGTGCCGAG 6 0 

GTGCAGCTGG TGCAGTCTGG AGCAGAGGTG AAAAAGCCCG GGGAGTCTCT GAAGATCTCC 12 0 

TGTAAGGGTT CTGGATACAG CTTTACCGGC TACTGGATCG GCTGGGTGCG CCAGATGCCC 18 0 

GGGAAAGGCC TGGAGTGGAT GGGGATCATC TATCCTGGTG ACT CTGATAC CACATACAGC 24 0 

CCGTCCTTCC AAGGCCAGGT CACCATCTCA GCCGACAAGT CCATCAGCAC CGCCTACCTG 3 00 

CAGTGGAGCA GCCTGAAGGC CTCGGACACC GCCATGTATT ACTGTGCGAG AGACCAACTG 36 0 

GGCCTCTTTG ACTACTGGGG CCAGGGAACC CTGGTCACCG TCTCCTCAGC CTCCACCAAG 42 0 

GGCCCATCGG TCTTCCCCCT GGCACCCTCC TCCAAGAAGC TT 462 

(2) INFORMATION FOR SEQ ID NO:208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 

ATGGACATGG AGTTCCCCGT TCAGCTC CTG GGGCTCCTGC TGCTCTGTTT CCCAGGTGCC 6 0 

AGATGTGACA TCCAGATGAC CCAGTCTCCA TCCTCACTGT CTGCATCTGT AGGAGACAGA 12 0 

GTCACCATCA CTTGTCGGGC GAGTCAGGGT ATTAGCAGCT GGTTAGCCTG GTAT CAGC AG 18 0 

AAACCAGAGA AAGCCCCTAA GTCCCTGATC TATTCTGCAT CCAGTTTGCA AAGTGGGGTC 24 0 

CCATCAAGGT TCAGCGGCAG TGGATCTGGG AC AGATTT C A CTCTCACCAT CAGCAGCCTG 3 00 

CAGCCTGAAG ATTTTGCAAC TTATTACTGC CAACAGTATG ATAGTTACCC GTACACTTTT 36 0 

GGC CAGGGGA CCAAGCTGGA GAT C AAACG A ACTGTGGCTG CACCATCTGT CTTCATCTTC 42 0 

CCGCCATCTG ATGAAGCTT 439 

(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 09: 

Asp He Thr Met Val Arg Gly Val Pro His 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:210: 

Gin Gin Tyr Gly Ser Ser Pro Leu Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

Pro Ala Asn Trp Asn Trp Tyr Phe Val Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 12 

Gin Gin Phe He Ser Tyr Pro Gin Leu Thr 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213 

Val lie Asn Trp Phe Asp Pro 
1 5 



(2) INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
{ C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214 

Gin Gin Ala Asn Ser Phe Pro Tyr Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
CO STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215 

Asp Gin Leu Gly Leu Phe Asp Tyr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 216: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216 

Gin Gin Tyr Asp Ser Tyr Pro Tyr Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

( D ) TOPOLOGY : linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 

TCTTCCGCTT CCTCGCTCAC TGACTCGCTG CGCTCGGTCG TTCGGCTGCG GCGAGCGGTA 60 

TCAGCTCACT CAAAGGCGGT AATACGGTTA TCCACAGAAT CAGGGGATAA CGCAGGAAAG 12 0 

AACATGTGAG CAAAAGGCCA GCAAAAGGCC AGGAACCGTA AAAAGGCCGC GTTGCTGGCG 180 

TTTTTCCATA GGCTCCGCCC CCCTGACGAG CATCACAAAA ATCGACGCTC AAGT CAGAGG 240 

TGGCGAAACC CGACAGGACT ATAAAGATAC CAGGCGTTTC CCCCTGGAAG CTCCCTCGTG 3 00 

CGCTCTCCTG TTCCGACCCT GCCGCTTACC GGATACCTGT CCGCCTTTCT CCCTTCGGGA 360 

AGCGTGGCGC TTTCTCATAG CTCACGCTGT AGGTATCTCA GTTCGGTGTA GGTCGTTCGC 42 0 

TCCAAGCTGG GCTGTGTGCA CGAACCCCCC GTTCAGCCCG ACCGCTGCGC CTTATCCGGT 48 0 

AACTATCGTC TTGAGTCCAA CCCGGTAAGA CACGACTTAT CGCCACTGGC AGCAGCCACT 54 0 

GGTAACAGGA TTAGCAGAGC GAGGTATGTA GGCGGTGCTA CAGAGTTCTT GAAGTGGTGG 6 00 

CCTAACTACG GCTACACTAG AAGGACAGTA TTTGGTATCT GCGCTCTGCT GAAGC CAGTT 660 

ACCTTCGGAA AAAGAGTTGG TAGCTCTTGA TCCGGCAAAC AAACCACCGC TGGTAGCGGT 72 0 

GGTTTTTTTG TTTGCAAGCA GCAGATTACG CGCAGAAAAA AAGGATCTCA AGAAGAT C CT 780 

TTGATCTTTT CTACGGGGTC TGACGCTCAG TGGAACGAAA ACTCACGTTA AGGGATTTTG 840 

GTCATGAGAT TATCAAAAAG GATCTTCACC TAGATCCTTT TAAATTAAAA ATGAAGTTTT 900 

AAATCAATCT AAAGTATATA TGAGTAAACT TGGTCTGACA GTTACCAATG CTTAATCAGT 960 

GAGGCACCTA TCTCAGCGAT CTGTCTATTT CGTTCATCCA TAGTTGCCTG ACTCCCCGTC 102 0 

GTGTAGATAA CTACGATACG GGAGGGCTTA CCATCTGGCC CCAGTGCTGC AATGATACCG 1080 

CGAGACCCAC GCTCACCGGC TCCAGATTTA TCAGCAATAA ACCAGCCAGC CGGAAGGGCC 1140 

GAGCGCAGAA GTGGTCCTGC AACTTTATCC GCCTCCATCC AGTCTATTAA TTGTTGCCGG 12 00 

GAAGCTAGAG TAAGTAGTTC GCCAGTTAAT AGTTTGCGCA ACGTTGTTGC CATTGCTACA 1260 

GGCATCGTGG TGTCACGCTC GTCGTTTGGT ATGGCTTCAT TCAGCTCCGG TTCCCAACGA 1320 

TCAAGGCGAG TTACATGATC CCCCATGTTG TGCAAAAAAG CGGTTAGCTC CTTCGGTCCT 138 0 

CCGATCGTTG TCAGAAGTAA GTTGGCCGCA GTGTTATCAC TCATGGTTAT GGCAGCACTG 1440 

CATAATTCTC TTACTGTCAT GCCATCCGTA AGATGCTTTT CTGTGACTGG TGAGTACTCA 1500 

ACCAAGTCAT TCTGAGAATA GTGTATGCGG CGACCGAGTT GCTCTTGCCC GGCGTCAATA 156 0 

CGGGATAATA CCGCGCCACA TAGCAGAACT TTAAAAGTGC TCATCATTGG AAAACGTTCT 162 0 

TCGGGGCGAA AACTCTCAAG GATCTTACCG CTGTTGAGAT C CAGTT CGAT GTAACCCACT 1680 

CGTGCACCCA ACTGATCTTC AGCATCTTTT ACTTTCACCA GCGTTTCTGG GTGAGCAAAA 174 0 

ACAGGAAGGC AAAATGCCGC AAAAAAGGGA ATAAGGGCGA CACGGAAATG TTGAATACTC 18 00 

ATACTCTTCC TTTTTCAATA TTATTGAAGC ATTTATCAGG GTTATTGTCT CATGAGCGGA 1860 
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TACATATTTG AATGTATTTA GAAAAATAAA CAAATAGGGG TTCCGCGCAC ATTTCCCCGA 192 0 

AAAGTGCCAC CTGACGTCTA AGAAACCATT ATTATCATGA CATTAACCTA TAAAAATAGG 1980 

CGTATCACGA GGCCCTTTCG TCTCGCGCGT TTCGGTGATG ACGGTGAAAA CCTCTGACAC 2 040 

ATGCAGCTCC CGGAGACGGT CACAGCTTGT CTGTAAGCGG ATGC CGGGAG CAGACAAGCC 2100 

CGTCAGGGCG CGTCAGCGGG TGTTGGCGGG TGTCGGGGCT GGCTTAACTA TGCGGCATCA 2160 

GAGCAGATTG TACTGAGAGT GCACCATATG CGGTGTGAAA TACCGCACAG ATGCGTAAGG 2 22 0 

AG AAAAT AC C GCATCAGGCG CCATTCGCCA TTCAGGCTGC GCAACTGTTG GGAAGGGCGA 228 0 

TCGGTGCGGG CCTCTTCGCT ATTACGCCAG CTGGCGAAAG GGGGATGTGC TGCAAGGCGA 2 34 0 

TTAAGTTGGG TAACGCCAGG GTTTTCCCAG TCACGACGTT GTAAAACGAC GGCCAGTGCC 2400 

AAGCTAGCGG CCGCGGTCCA ACCACCAATC TCAAAGCTTG GTACCCGGGA GCCTGTTATC 2460 

CCAGCACAGT CCTGGAAGAG GCACAGGGGA AATAAAAGCG GACGGAGGCT TTCCTTGACT 252 0 

CAGCCGCTGC CTGGTCTTCT TCAGACCTGT TCTGAATTCT AAACTCTGAG GGGGTCGGAT 2 580 

GACGTGGCCA TTCTTTGCCT AAAGCATTGA GTTTACTGCA AGGTCAGAAA AGC ATGC AAA 2 640 

GCCCTCAGAA TGGCTGCAAA GAGCTCCAAC AAAACAATTT AGAACTTTAT TAAGGAATAG 2 7 00 

GGGGAAGCTA GGAAGAAACT CAAAACATCA AGATTTTAAA TACGCTTCTT GGTCTCCTTG 2 76 0 

CTATAATTAT CTGGGATAAG CATGCTGTTT TCTGTCTGTC CCTAACATGC CCTGTGATTA 2 82 0 

TCCGCAAACA ACACACCCAA GGGCAGAACT TTGTTACTTA AACACCATCC TGTTTGCTTC 2880 

TTTCCTCAGG AACTGTGGCT GCACCATCTG TCTTCATCTT CCCGCCATCT GATGAGCAGT 2 94 0 

TGAAATCTGG AACTGCCTCT GTTGTGTGCC TGCTGAATAA CTTCTATCCC AGAGAGGCCA 3000 

AAGTACAGTG GAAGGTGGAT AACGCCCTCC AATCGGGTAA CTCCCAGGAG AGTGTCACAG 3 060 

AGCAGGACAG CAAGGACAGC ACCTACAGCC TCAGCAGCAC CCTGACGCTG AGCAAAGCAG 312 0 

ACTACGAGAA ACACAAAGTC TACGCCTGCG AAGTCACCCA TCAGGGCCTG AGCTCGCCCG 3180 

TCACAAAGAG CTTCAACAGG GGAGAGTGTT AGAGGGAGAA GTGCCCCCAC CTGCTCCTCA 324 0 

GTTCCAGCCT GACCCCCTCC CATCCTTTGG CCTCTGACCC TTTTTCCACA GGGGACCTAC 33 00 

CCCTATTGCG GTCCTCCAGC TCATCTTTCA CCTCACCCCC CTCCTCCTCC TTGGCTTTAA 336 0 

TTATGCTAAT GTTGGAGGAG AATGAATAAA TAAAGTGAAT CTTTGCACCT GTGGTTTCTC 3420 

TCTTTCCTCA ATTTAATAAT TATTATCTGT TGTTTACCAA CTACTCAATT TCTCTTATAA 348 0 

GGGACTAAAT ATGTAGTCAT CCTAAGGCGC ATAACCATTT AT AAAAAT C A TCCTTCATTC 3 54 0 

TATTTTACCC TATCATCCTC TGCAAGACAG TCCTCCCTCA AACC CACAAG CCTTCTGTCC 360 0 

TCACAGTCCC CTGGGCCATG GATCCTCACA TCCCAATCCG CGGCCGCAAT TCGTAAT CAT 36 6 0 

GGTCATAGCT GTTTCCTGTG TGAAATTGTT ATCCGCTCAC AATTCCACAC AACATACGAG 3 72 0 

CCGGAAGCAT AAAGTGTAAA GCCTGGGGTG CCTAATGAGT GAGCTAACTC ACATTAATTG 37 8 0 

CGTTGCGCTC ACTGCCCGCT TTCCAGTCGG GAAACCTGTC GTGCCAGCTG CATTAATGAA 384 0 

TCGGCCAACG CGCGGGGAGA GGCGGTTTGC GTATTGGGCG C 3881 
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(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

<ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:218: 

GAACTCGAGC AGCTGAAGCT TTCTGGGGCA GGCCAGGCCT GACCTTGGCT TTGGGGCAGG 6 0 

GAGGGGGCTA AGGTGAGGCA GGTGGCGCCA GCCAGGTGCA CACCCAATGC CCATGAGCCC 12 0 

AGACACTGGA CGCTGAACCT CGCGGACAGT TAAGAACCCA GGGGCCTCTG CGCCCTGGGC 18 0 

CCAGCTCTGT CCCACACCGC GGTCACATGG CACCACCTCT CTTGCAGCCT CCACCAAGGG 240 

CCCATCGGTC TTCCCCCTGG CACCCTCCTC CAAGAGC AC C TCTGGGGGCA CAGCGGCCCT 300 

GGGCTGCCTG GTCAAGGACT ACTTCCCCGA ACCGGTGACG GTGTCGTGGA ACTCAGGCGC 36 0 

CCTGACCAGC GGCGTGCACA CCTTCCCGGC TGTCCTACAG TCCTCAGGAC TCTACTCCCT 420 

CAGCAGCGTG GTGACCGTGC CCTCCAGCAG CTTGGGCACC CAGACCTACA TCTGCAACGT 48 0 

GAATCACAAG CCCAGCAACA CCAAGGTGGA CAAGAAAGTT GGTGAGAGGC CAGCACAGGG 540 

AGGGAGGGTG TCTGCTGGAA GCCAGGCTCA GCGCTCCTGC CTGGACGCAT CCCGGCTATG 6 00 

CAGCCCCAGT CCAGGGCAGC AAGGCAGGCC CCGTCTGCCT CTTCACCCGG AGGCCTCTGC 660 

CCGCCCCACT CATGCTCAGG GAGAGGGTCT TCTGGCTTTT TCCCCAGGCT CTGGGCAGGC 72 0 

ACAGGCTAGG TGCCCCTAAC CCAGGCCCTG CACACAAAGG GGCAGGTGCT GGGCTCAGAC 780 

CTGC CAAGAG CCATATCCGG GAGGACCCTG CCCCTGACCT AAGCCCACCC CAAAGGCCAA 840 

ACTCTCCACT CCCTCAGCTC GGACACCTTC TCTCCTCCCA GATTCCAGTA ACTCCCAATC 900 

TTCTCTCTGC AGAGCCCAAA TCTTGTGACA AAACT CACAC ATGCCCACCG TGCCCAGGTA 960 

AGCCAGCCCA GGCCTCGCCC TCCAGCTCAA GGCGGGACAG GTGCCCTAGA GTAGCCTGCA 102 0 

TCCAGGGACA GGCCCCAGCC GGGTGCTGAC ACGTCCACCT CCATCTCTTC CTCAGCACCT 108 0 

GAACTCCTGG GGGGACCGTC AGTCTTCCTC TTCCCCCCAA AACCCAAGGA CACCCTCATG 1140 

ATCTCCCGGA CCCCTGAGGT CACATGCGTG GTGGTGGACG TGAGCCACGA AGACCCTGAG 12 00 

GTCAAGTTCA ACTGGTACGT GGACGGCGTG GAGGTGCATA ATGCCAAGAC AAAGCCGCGG 1260 

GAGGAGCAGT ACAACAGCAC GTACCGTGTG GTCAGCGTCC TCACCGTCCT GCACCAGGAC 132 0 

TGGCTGAATG GCAAGGAGTA CAAGTGCAAG GTCTCCAACA AAGCCCTCCC AGCCCCCATC 13 80 

GAGAAAACCA TCTCCAAAGC CAAAGGTGGG ACCCGTGGGG TGCGAGGGCC ACATGGACAG 1440 

AGGCCGGCTC GGCCCACCCT CTGCCCTGAG AGTGACCGCT GTACCAACCT CTGTCCCTAC 1500 

AGGGCAGCCC CGAGAAC CAC AGGTGTACAC CCTGCCCCCA TCCCGGGATG AGCTGACCAA 1560 

GAACCAGGTC AGCCTGACCT GCCTGGTCAA AGGCTTCTAT CCCAGCGACA TCGCCGTGGA 162 0 
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GTGGGAGAGC AATGGGCAGC CGGAGAACAA CTACAAGACC ACGCCTCCCG TGCTGGACTC 1680 

CGACGGCTCC TTCTTCCTCT ACAGCAAGCT CACCGTGGAC AAGAGCAGGT GGCAGCAGGG 1740 

GAACGTCTTC TCATGCTCCG TGATGCATGA GGCTCTGCAC AACCACTACA CGCAGAAGAG 1800 

CCTCTCCCTG TCTCCGGGTA AATGAGTGCG ACGGCCGGCA AGCCCCCGCT CCCCGGGCTC 1860 

TCGCGGTCGC ACGAGGATGC TTGGCACGTA CCCCCTGTAC ATACTTCCCG GGCGCCCAGC 1920 

ATGGAAATAA AGCACCCAGC GCTGCCCTGG GCCCCTGCGA GACTGTGATG GTTCTTTCCA 1980 

CGGGTCAGGC CGAGTCTGAG GCCTGAGTGG CATGAGGGAG GCAGAGCGGG TCCCACTGTC 2040 

CCCACACTGG CCCAGGCTGT GCAGGTGTGC CTGGGCCCCC TAGGGTGGGG CTCAGCCAGG 2100 

GGCTGCCCTC GGCAGGGTGG GGGATTTGCC AGCGTGGCCC TCCCTCCAGC AGCACCTGCC 2160 

CTGGGCTGGG CCACGGGAAG CCCTAGGAGC CCCTGGGGAC AGACACACAG CCCCTGCCTC 2220 

TGTAGGAGAC TGTCCTGTTC TGTGAGCGCC CCTGTCCTCC CGACCTCCAT GCCCACTCGG 228 0 

GGGCATGCCT GCAGGTCGAC TCTAGAGGAT CCCCGGGTAC CGAGCTCGAA TTCATCGATG 2340 

ATATCAGATC TGCCGGTCTC CCTATAGTGA GTCGTATTAA TTTCGATAAG CCAGGTTAAC 2400 

CTGCATTAAT GAATCGGCCA ACGCGCGGGG AGAGG CGGTT TGCGTATTGG GCGCTCTTCC 2460 

GCTTCCTCGC TCACTGACTC GCTGCGCTCG GTCGTTCGGC TGCGGCGAGC GGTATCAGCT 2520 

CACTCAAAGG CGGTAATACG GTTATC C AC A GAATCAGGGG ATAACGCAGG AAAGAACATG 2580 

TGAGCAAAAG GCCAGCAAAA GGCCAGGAAC CGTAAAAAGG CCGCGTTGCT GGCGTTTTTC 2640 

CATAGGCTCC GCCCCCCTGA CGAGCATCAC AAAAAT CG AC GCTCAAGTCA GAGGTGGCGA 2700 

AACCCGACAG GACTATAAAG ATACCAGGCG TTTCCCCCTG GAAGCTCCCT CGTGCGCTCT 2760 

CCTGTTCCGA CCCTGCCGCT TACCGGATAC CTGTCCGCCT TTCTCCCTTC GGGAAGCGTG 2820 

GCGCTTTCTC AATGCTCACG CTGTAGGTAT CTCAGTTCGG TGTAGGTCGT TCGCTCCAAG 2880 

CTGGGCTGTG TGCACGAACC CCCCGTTCAG CCCGACCGCT GCGCCTTATC CGGTAACTAT 2940 

CGTCTTGAGT CCAACCCGGT AAGACACGAC TTATCGCCAC TGGCAGCAGC CACTGGTAAC 3 000 

AGGATTAGCA GAGCGAGGTA TGTAGGCGGT GCTACAGAGT TCTTGAAGTG GTGGCCTAAC 3 060 

TACGGCTACA CTAGAAGGAC AGTATTTGGT ATCTGCGCTC TGCTGAAGCC AGTTACCTTC 3120 

GGAAAAAGAG TTGGTAGCTC TTGATCCGGC AAACAAACCA CCGCTGGTAG CGGTGGTTTT 3180 

TTTGTTTGCA AGCAGCAGAT TACGCGCAGA AAAAAAGGAT CTCAAGAAGA TCCTTTGATC 3240 

TTTTCTACGG GGTCTGACGC TCAGTGGAAC GAAAACTCAC GTTAAGGGAT TTTGGTCATG 3300 

AGATTATCAA AAAGGATCTT CACCTAGATC CTTTTAAATT AAAAATGAAG TTTTAAATCA 3360 

ATCTAAAGTA TATATGAGTA AACTTGGTCT GACAGTTACC AATGCTTAAT CAGTGAGGCA 3420 

CCTATCTCAG CGATCTGTCT ATTTCGTTCA TCCATAGTTG CCTGACTCCC CGTCGTGTAG 3480 

ATAACTACGA TACGGGAGGG CTTACCATCT GGCCCCAGTG CTGCAATGAT ACCGCGAGAC 3540 

CCACGCTCAC CGGCTCCAGA TTTATCAGCA ATAAACCAGC CAGCCGGAAG GGCCGAGCGC 3600 

AGAAGTGGTC CTGCAACTTT ATCCGCCTCC ATCCAGTCTA TTAATTGTTG CCGGGAAGCT 3 660 
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AGAGTAAGTA GTTCGCCAGT TAATAGTTTG CGCAACGTTG TTGCCATTGC TACAGGCATC 372 0 

GTGGTGTCAC GCTCGTGGTT TGGTATGGCT TCATTCAGCT CCGGTTCCCA ACGATCAAGG 3 78 0 

CGAGTTACAT GATCCCCCAT GTTGTGCAAA AAAGCGGTTA GCTCCTTCGG TCCTCCGATC 384 0 

GTTGTCAGAA GTAAGTTGGC CGGAGTGTTA TCACTCATGG TTATGGCAGC ACTGCATAAT 3 900 

TCTCTTACTG TCATGCCATC CGTAAGATGC TTTTCTGTGA CTGGTGAGTA CTCAACCAAG 3 960 

TCATTCTGAG AATAGTGTAT GCGGCGACCG AGTTGCTCTT GCCCGGCGTC AATACGGGAT 4020 

AATACCGCGC CACATAGCAG AACTTTAAAA GTGCTCATCA TTGGAAAACG TTCTTCGGGG 4 08 0 

CGAAAACTCT CAAGGATCTT ACCGCTGTTG AGATCCAGTT CGATGTAACC CACTCGTGCA 4140 

CCCAACTGAT CTTCAGCATC TTTTACTTTC ACCAGCGTTT CTGGGTGAGC AAAAACAGGA 42 0 0 

AGGCAAAATG CCGCAAAAAA GGGAATAAGG GCGACACGGA AATGTTGAAT ACTCATACTC 42 6 0 

TTCCTTTTTC AATATTATTG AAGCATTTAT CAGGGTTATT GTCTCATGAG CGGATACATA 43 2 0 

TTTGAATGTA TTTAGAAAAA TAAACAAATA GGGGTTCCGC GCACATTTCC CCGAAAAGTG 4380 

CCACCTGACG TCTAAGAAAC CATTATTATC ATGACATTAA CCTATAAAAA TAGGCGTATC 444 0 

ACGAGGCCCT TTCGTCTCGC GCGTTTCGGT GATGACGGTG AAAACCTCTG ACACATGCAG 4500 

CTCCCGGAGA CGGTCACAGC TTGTCTGTAA GCGGATGCCG GGAGCAGACA AGCCCGTCAG 456 0 
GGCGCGTCAG CGGGTGTTGG CGGGTGTCGG GGCTGGCTTA ACTATGCGGC AT C AGAGC AG 
ATTGTACTGA GAGTGCACCA TATGGACATA TTGTCGTTAG AACGCGGCTA CAATTAATAC 
ATAACCTTAT GTATCATACA CATACGATTT AGGTGACACT ATA 

(2) INFORMATION FOR SEQ ID NO:219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:219: 
AAGCTTGCCA C C ATGAAAC A CCTGTGGTTC TTCCTCCTCC TGGTGGCAGC TCCTAGATGG 6 0 

GTCCTGTCTC AGGTGCAGCT ACAGCAGTGG GGCGCAGGAC TGTTGAAGCC TTCGGAGACC 
CTGTCCCTCA CCTGCGCTGT CTATGGTGGT TCCTTCAGTG GTTACTACTG GAGCTGGATC 
CGCCAGCCAC CAGGTAAGGG TCTGGAGTGG ATTGGTGAAA TCAATCATAG TGGAAGCACC 240 
AACTACAACC CGTCTCTCAA GAGTCGAGTC AC C ATATCAG TAGACACGTC CAAGAACCAG 3 00 

TTCTCTCTGA AACTGAGCTC TGTGACCGCT GCGGACACGG CTGTGTATTA CTGTGCGAGA 
GTAATTAATT GGTTCGACCC TTGGGGCCAG GGAACCCTGG TCACCGTCTC CTCAGCCTCA 
ACCAAGGGCC CATCGGTCTT CCCCCTGGCA CCCTCCTCCA AGAGCAC CTC TGGGGGCACA 
GCGGCCCTGG GCTGCCTGGT CAAGGACTAC TTCCCCGAAC CGGT 



4620 
4680 
4723 



120 
180 



360 
420 
480 
524 
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(2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 0: 



AAGCTTGCCA 


CCATGATGGT 


CCCAGCTCAG 


CTCCTCGGTC 


TCCTGCTGCT 


CTGGTTCCCA 


60 


GGTTCCAGAT 


GCGACATCCA 


GATGAC CCAG 


TCTCCATCTT 


CCGTGTCTGC 


AT CTGTAGGA 


120 


GACAGAGTCA 


CCATCACTTG 


TCGGGCGAGT 


CAGGATATTA 


GCAGCTGGTT 


AGCCTGGTAT 


180 


CAGCATAAAC 


CAGGTAAAGC 


ACCTAAGCTC 


CTGATCTATG 


CTGCATCCAG 


TTTGCAAAGT 


240 


GGTGTCCCAT 


CAAGGTTCAG 


CGGAAGTGGA 


TCTGGGACAG 


ATTTCACTCT 


CACCATCAGC 


300 


AGCCTGCAGC 


CTGAAGATTT 


TGCAACTTAC 


TATTGTCAAC 


AGGCTAATAG 


TTTCCCGTAC 


360 


ACTTTTGGTC 


AGGGAACCAA 


GCTGGAGATC 


AAACGAACTG 


TGGCTGCACC 


ATCTGTCTTC 
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(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 
ATGGTCCCAG CTCAGCTCCT CGGTCTCCTG CTGCTCTGGT TCCC 



(2) INFORMATION FOR SEQ ID NO:222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:222: 
AGGTTC CAGA TGCGACATCC AGATGACCCA GTCTCCATCT TCCG 



(2) INFORMATION FOR SEQ ID NO:223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 3: 
TGTCTGCATC TGTAGGAGAC AGAGTCACCA TCACTTGTCG GGCG 



(2) INFORMATION FOR SEQ ID NO:224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 
AGTCAGGATA TTAGCAGCTG GTTAGCCTGG TATCAGCATA AACC 



(2) INFORMATION FOR SEQ ID NO: 225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 
AGGTAAAGCA CCTAAGCTCC TGATCTATGC TGCATCCAGT TTGC 



(2) INFORMATION FOR SEQ ID NO: 22 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 
AGGAGCTTAG GTGCTTTACC TGGTTTATGC TGATACCAGG CTAA 



(2) INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 
CCAGCTGCTA ATATCCTGAC TCGCCCGACA AGTGATGGTG ACTC 



(2) INFORMATION FOR SEQ ID NO:228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 
TGTCTCCTAC AGATGCAGAC ACGGAAGATG GAGACTGGGT CATC 



(2) INFORMATION FOR SEQ ID NO: 22 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 
TGGATGTCGC ATCTGGAACC TGGGAACCAG AGCAGCAGGA GACC 



(2) INFORMATION FOR SEQ ID NO: 230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:230: 
GAGGAGCTGA GCTGGGACCA TCATGGTGGC AAGCTTAGAG TC 



(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 
GACTCTAAGC TTGCCACCAT GATGGTCC 
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(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:232: 
ACCTTGATGG GACACCACTT TGCAAACTGG ATGCAGCATA GATC 



(2) INFORMATION FOR SEQ ID NO: 23 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:233: 
AAAGTGGTGT CCCATCAAGG TTCAGCGGAA GTGGATCTGG GACA 



(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:234: 
GATTTCACTC TCACCATCAG CAGCCTGCAG CCTGAAGATT TTGC 



(2) INFORMATION FOR SEQ ID NO: 23 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 5: 
AACTTACTAT TGTCAACAGG CTAATAGTTT CCCGTACACT TTTG 
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(2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 
GTCAGGGAAC CAAGCTGGAG ATCAAACGAA CTGTGGCTGC ACCA 



(2) INFORMATION FOR SEQ ID NO: 23 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:237: 
TCTGTCTTCA TCTTCCCGCC ATCTGATGAG CAGTTGA 



(2) INFORMATION FOR SEQ ID NO: 23 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 
GGGAAGATGA AGACAGATGG TGCAGCCACA GTTCGTTTGA 



(2) INFORMATION FOR SEQ ID NO: 23 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 
TCTCCAGCTT GGTTCCCTGA CCAAAAGTGT ACGGGAAACT ATTA 



44 




350 

(2) INFORMATION FOR SEQ ID NO:240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 0: 
GCCTGTTGAC AATAGTAAGT TGCAAAATCT TCAGGCTGCA GGCT 44 

(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 
GCTGATGGTG AGAGTGAAAT CTGTCCCAGA TCCACTTCCG CTGA 44 

(2) INFORMATION FOR SEQ ID NO: 242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:242: 
TCAACTGCTC ATCAGATGGC 20 

(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3819 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 3: 
TCTTCCGCTT CCTCGCTCAC TGACTCGCTG CGCTCGGTCG TTCGGCTGCG GCGAGCGGTA 6 0 

TCAGCTCACT CAAAGGCGGT AATACGGTTA TCCACAGAAT CAGGGGATAA CGCAGGAAAG 12 0 

AACATGTGAG CAAAAGGCCA GCAAAAGGCC AGGAAC CGTA AAAAGGCCGC GTTGCTGGCG 18 0 
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TTTTTCCATA GGCTCCGCCC CCCTGACGAG CATCACAAAA ATCGACGCTC AAGTCAGAGG 24 0 

TGGCGAAACC CGACAGGACT ATAAAGATAC CAGGCGTTTC CCCCTGGAAG CTCCCTCGTG 3 00 

CGCTCTCCTG TTCCGACCCT GCCGCTTACC GGATACCTGT CCGCCTTTCT CCCTTCGGGA 360 

AGCGTGGCGC TTTCTCATAG CTCACGCTGT AGGTATCTCA GTTCGGTGTA GGTCGTTCGC 42 0 

TCCAAGCTGG GCTGTGTGCA CGAACCCCCC GTTCAGCCCG ACCGCTGCGC CTTATCCGGT 480 

AACTATCGTC TTGAGTCCAA CCCGGTAAGA CACGACTTAT CGCCACTGGC AGCAGCCACT 540 

GGTAACAGGA TTAGCAGAGC GAGGTATGTA GGCGGTGCTA CAGAGTTCTT GAAGTGGTGG 60 0 

CCTAACTACG GCTACACTAG AAGGACAGTA TTTGGTATCT GCGCTCTGCT GAAGCCAGTT 66 0 

ACCTTCGGAA AAAGAGTTGG TAGCTCTTGA TCCGGCAAAC AAACCACCGC TGGTAGCGGT 72 0 

GGTTTTTTTG TTTGCAAGCA GCAGATTACG CGCAGAAAAA AAGGATCTCA AGAAGATCCT 78 0 

TTGATCTTTT CTACGGGGTC TGACGCTCAG TGGAACGAAA ACTCACGTTA AGGGATTTTG 84 0 

GTCATGAGAT TATCAAAAAG GATCTTCACC TAGATCCTTT TAAATTAAAA ATGAAGTTTT 90 0 

AAATCAATCT AAAGTATATA TGAGTAAACT TGGTCTGACA GTTACCAATG CTTAAT CAGT 96 0 

GAGGCACCTA TCTCAGCGAT CTGTCTATTT CGTTCATCCA TAGTTGCCTG ACTCCCGGTC 102 0 

GTGTAGATAA CTACGATACG GGAGGGCTTA CCATCTGGCC CCAGTGCTGC AATGATACCG 108 0 

CGAGACCCAC GCTCACCGGC TCCAGATTTA TCAGCAATAA ACCAGCCAGC CGGAAGGGCC 1140 

GAGCGCAGAA GTGGTCCTGC AACTTTATCC GGCTCCATCC AGTCTATTAA TTGTTGCCGG 12 0 0 

GAAGCTAGAG TAAGTAGTTC GCCAGTTAAT AGTTTGCGCA ACGTTGTTGC CATTGCTACA 12 6 0 

GGCATCGTGG TGTCACGCTC GTGGTTTGGT ATGGCTTCAT TCAGCTCCGG TTCCCAACGA 132 0 

TCAAGGCGAG TTACATGATC CCCCATGTTG TGCAAAAAAG CGGTTAGCTC CTTCGGTCCT 138 0 

CCGATCGTTG TCAGAAGTAA GTTGGCCGCA GTGTTATCAC TCATGGTTAT GGCAGGACTG 1440 

CATAATTCTC TTACTGTCAT GCCATCCGTA AGATGCTTTT CTGTGACTGG TGAGTACTCA 150 0 

ACCAAGTCAT TCTGAGAATA GTGTATGCGG CGACCGAGTT GCTCTTGCCC GGCGT CAATA 156 0 

CGGGATAATA CCGCGGCAGA TAGCAGAACT TTAAAAGTGC TCATCATTGG AAAACGTTCT 162 0 

TCGGGGCGAA AACTCTCAAG GATCTTACCG CTGTTGAGAT CCAGTTCGAT GTAACCCACT 16 8 0 

CGTGCACCCA ACTGATCTTC AGCATCTTTT ACTTTCACCA GCGTTTCTGG GTGAGCAAAA 174 0 

ACAGGAAGGC AAAATGCCGC AAAAAAGGGA ATAAGGGCGA CACGGAAATG TTGAATACTC 180 0 

ATACTCTTCC TTTTTCAATA TTATTGAAGC ATTTATCAGG GTTATTGTCT CATGAGCGGA 186 0 

TACATATTTG AATGTATTTA GAAAAATAAA CAAATAGGGG TTCCGCGCAC ATTTCCCCGA 192 0 

AAAGTGC C AC CTGACGTCTA AGAAACCATT ATTAT CATGA CATTAACCTA TAAAAATAGG 1980 

CGTATCACGA GGCCCTTTCG TCTCGCGCGT TTCGGTGATG ACGGTGAAAA CCTCTGACAC 2 040 

ATGCAGCTCC CGGAGACGGT CACAGCTTGT CTGTAAGCGG ATGCCGGGAG CAGACAAGCC 210 0 

CGTCAGGGCG CGTCAGCGGG TGTTGGCGGG TGTCGGGGCT GGCTTAACTA TGCGGGATCA 216 0 

GAGCAGATTG TACTGAGAGT GCACCATATG CGGTGTGAAA TACCGCACAG ATGCGTAAGG 222 0 




352 



AGAAAATACC 


GCATCAGGCG 


CCATTCGCCA 


TTCAGGCTGC 


GCAACTGTTG 


GGAAGGGCGA 


2280 


TCGGTGCGGG 


CCTCTTCGCT 


ATTACGCCAG 


CTGGCGAAAG 


GGGGATGTGC 


TGCAAGGCGA 


2340 


TTAAGTTGGG 


TAACGCCAGG 


GTTTTCCCAG 


TCACGACGTT 


GTAAAACGAC 


GGCCAGTGCC 


2400 


AAGCTAGCGG 


CCGCGGTCCA 


ACCACCAATC 


TCAAAGCTTG 


CCACGATGAT 


GGTCCCAGCT 


2460 


CAGCTCCTCG 


GTCTCCTGGT 


GCTCTGGTTC 


CCAGGTTCCA 


GATGCGACAT 


CCAGATGACC 


2520 


CAGTCTCCAT 


CTTCCGTGTC 


TGCATCTGTA 


GGAGACAGAG 


TCACCATCAC 


TTGTCGGGCG 


2580 


AGTCAGGATA 


TTAGCAGCTG 


GTTAGCCTGG 


TATCAGCATA 


AACCAGGTAA 


AGCAC CTAAG 


2640 


CTCCTGATCT 


ATGCTGCATC 


CAGTTTGCAA 


AGTGGTGTCC 


CATCAAGGTT 


CAGCGGAAGT 


2700 


GGATCTGGGA 


CAGATTTCAC 


TCTCACCATC 


AGCAGCCTGC 


AGCCTGAAGA 


TTTTGCAACT 


2760 


TACTATTGTC 


AACAGGCTAA 


TAGTTTCCCG 


TACACTTTTG 


GTCAGGGAAC 


CAAGCTGGAG 


2820 


AT CAAACGAA 


CTGTGGCTGC 


ACCATCTGTC 


TTCATCTTCC 


CGCCATCTGA 


TGAGCAGTTG 


2880 


AAATCTGGAA 


CTGCCTCTGT 


TGTGTGCCTG 


CTGAATAACT 


TCTATCCCAG 


AGAGGCCAAA 


2940 


GTACAGTGGA 


AGGTGGATAA 


CGCCCTCCAA 


TCGGGTAACT 


C C C AGG AG AG 


TGTCACAGAG 


3000 


CAGGACAGCA 


AGGACAGCAC 


CTACAGCCTC 


AGCAGCACCC 


TGACGCTGAG 


CAAAGCAGAC 


3060 


TACGAGAAAC 


ACAAAGTCTA 


CGCCTGCGAA 


GTCACCCATC 


AGGGCCTGAG 


CTCGCCCGTC 


3120 


ACAAAGAGCT 


TCAACAGGGG 


AGAGTGTTAG 


AGGGAGAAGT 


GCCCCCACCT 


GCTCCTCAGT 


3180 


TCCAGCCTGA 


CCCCCTCCCA 


TCGTTTGGCC 


TCTGACCCTT 


TTTCCACAGG 


GGACCTACCC 


3240 


CTATTGCGGT 


CCTCCAGCTC 


ATCTTTCACC 


TCACCCCCCT 


CCTCCTCCTT 


GGCTTTAATT 


3300 


ATGCTAATGT 


TGGAGGAGAA 


TGAATAAATA 


AAGTGAATCT 


TTGCACCTGT 


GGTTTCTCTC 


3360 


TTTCCTCAAT 


TTAATAATTA 


TTATCTGTTG 


TTTACCAACT 


ACTCAATTTC 


TCTTATAAGG 


3420 


GACTAAATAT 


GTAGTCATCC 


TAAGGCGCAT 


AACCATTTAT 


AAAAATCATC 


CTTCATTGTA 


3480 


TTTTACCCTA 


TCATCCTCTG 


CAAGACAGTC 


CTCCCTCAAA 


CCCACAAGCC 


TTCTGTCCTC 


3540 


ACAGTCCCCT 


GGGCCATGGA 


TCCTCACATC 


CCAATCCGCG 


GCCGCAATTC 


GTAATCATGG 


3600 


TCATAGCTGT 


TTCCTGTGTG 


AAATTGTTAT 


CCGCTCACAA 


TTCCACACAA 


CATACGAGCC 


3660 


GGAAGCATAA 


AGTGTAAAGC 


CTGGGGTGCC 


TAATGAGTGA 


GCTAACTCAC 


ATTAATTGCG 


3720 


TTGCGCTCAC 


TGCCCGCTTT 


CCAGTGGGGA 


AACCTGTCGT 


GCCAGCTGCA 


TTAATGAATC 


3780 


GGCCAACGCG 


CGGGGAGAGG 


CGGTTTGCGT 


ATTGGGCGC 
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(2) INFORMATION FOR SEQ ID NO:244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
( D } TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: 



DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 
TTCTTCCTCC TCCTGGTGGC AGCTCCTAGA TGGGTCCTGT CTC 



(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 
AGGTGCAGCT ACAGCAGTGG GGCGCAGGAC TGTTGAAGCC TTC 



(2) INFORMATION FOR SEQ ID NO:246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:246: 
GGAGACCCTG TCCCTCACCT GCGCTGTCTA TGGTGGTTCC TTC 



(2) INFORMATION FOR SEQ ID NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 7: 
AGTGGTTACT ACTGGAGCTG GATCCGCCAG CCACCAGGTA AGG 



(2) INFORMATION FOR SEQ ID NO: 24 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 
GTCTGGAGTG GATTGGTGAA ATCAATCATA GTGGAAGCAC CAA 
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(2) INFORMATION FOR SEQ ID NO:249: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 43 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 9: 
TTCACCAATC CACTCCAGAC CCTTACCTGG TGGCTGGCGG ATC 



(2) INFORMATION FOR SEQ ID NO:250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 0: 
CAGCTCCAGT AGTAAC CACT GAAGGAACCA C CAT AG AC AG CGC 



(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 
AGGTGAGGGA CAGGGTCTCC GAAGGCTTCA ACAGTCCTGC GCC 



(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:252: 
CCACTGCTGT AGCTGCACCT GAGACAGGAC CCATCTAGGA GCT 



(2) INFORMATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:253: 
GCCACCAGGA GGAGGAAGAA CCACAGGTGT TTCATGGTGG CAAGCTTG 



(2) INFORMATION FOR SEQ ID NO: 2 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 54: 
CATGAAACAC CTGTGGTTCT TCC 



(2) INFORMATION FOR SEQ ID NO:255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 
TCTTGAGAGA CGGGTTGTAG TTGGTGCTTC CACTATGATT GAT 



(2) INFORMATION FOR SEQ ID NO:256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 
CTACAACCCG TCTCTCAAGA GTCGAGTCAC CATATCAGTA GAC 



(2) INFORMATION FOR SEQ ID NO: 257: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 57: 
ACGTCCAAGA ACCAGTTCTC TCTGAAACTG AGCTCTGTGA CCG 



(2) INFORMATION FOR SEQ ID NO:258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:258: 
CTGCGGACAC GGCTGTGTAT TACTGTGCGA GAGTAATTAA TTG 



(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 
GTTCGACCCT TGGGGCCAGG GAACCCTGGT CACCGTCTCC TCA 



(2) INFORMATION FOR SEQ ID NO: 26 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 
GCCTCAACCA AGGGCCCATC GGTCTTCCCC CTGGCACC 
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(2) INFORMATION FOR SEQ ID NO: 2 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 61: 
CGATGGGCCC TTGGTTGAGG CTGAGGAGAC GGTGACCAGG GTTC 



(2) INFORMATION FOR SEQ ID NO:262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:262: 
CCTGGCCCCA AGGGTCGAAC CAATTAATTA CTCTCGCACA GTA 



(2) INFORMATION FOR SEQ ID NO:263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:263: 
ATACACAGCC GTGTCCGCAG CGGT CACAGA GCTCAGTTTC AGA 



(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 64: 
GAGAACTGGT TCTTGGACGT GTCTACTGAT ATGGTGACTC GAC 
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(2) INFORMATION FOR SEQ ID NO:265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 
GAAGCACCAA CTACAACCCG 



(2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 
GAGTTCCACG ACACCGTCAC C 



(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 
GAC CTCAAGC TTGCCACCAT GAAACACCTG TGG 



(2) INFORMATION FOR SEQ ID NO:268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4926 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:268: 
GAACTCGAGC AGCTGAAGCT TGCCACCATG AAACACCTGT GGTTCTTCCT CCTCCTGGTG 
GCAGCTCCTA GATGGGTCCT GTCTCAGGTG CAGCTACAGC AGTGGGGCGC AGGACTGTTG 
AAGCCTTCGG AGACCCTGTC CCTCACCTGC GCTGTCTATG GTGGTTCCTT CAGTGGTTAC 
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TACTGGAGCT GGATCCGCCA GCCACCAGGT AAGGGTCTGG AGTGGATTGG TGAAATCAAT 240 

CATAGTGGAA GCACCAACTA CAACCCGTCT CTCAAGAGTC GAGTCACCAT ATCAGTAGAC 3 00 

ACGT CCAAGA ACCAGTTCTC TCTGAAACTG AGCTCTGTGA CCGCTGCGGA CACGGCTGTG 36 0 

TATTACTGTG CGAGAGTAAT TAATTGGTTC GACCCTTGGG GCCAGGGAAC CCTGGTCACC 42 0 

GTCTCCTCAG CCTCAACCAA GGGCCCATCG GTCTTCCCCC TGGCACCCTC CTCCAAGAGC 48 0 

ACCTCTGGGG GCACAGCGGC CCTGGGCTGC CTGGTCAAGG ACTACTTCCC CGAACCGGTG 54 0 

ACGGTGTCGT GGAACTCAGG CGCCCTGACC AGCGGCGTGC ACACCTTCCC GGCTGTCCTA 60 0 

CAGTCCTCAG GACTCTACTC CCTCAGCAGC GTGGTGACCG TGCCCTCCAG CAGCTTGGGC 66 0 

ACCCAGACCT ACATCTGCAA CGTGAATCAC AAGCCCAGCA AC AC C AAGGT GGACAAGAAA 72 0 

GTTGGTGAGA GGCCAGCACA GGGAGGGAGG GTGTCTGCTG GAAGCCAGGC TCAGCGCTCC 78 0 

TGCCTGGACG CATCCCGGCT ATGCAGCCCC AGTCCAGGGC AGCAAGGCAG GCCCCGTCTG 84 0 

CCTCTTCACC CGGAGGCCTC TGCCCGCCCC ACTCATGCTC AGGGAGAGGG TCTTCTGGCT 9 00 

TTTTCCCCAG GCTCTGGGCA GGCACAGGCT AGGTGCCCCT AACCCAGGCC CTGCACACAA 96 0 

AGGGGCAGGT GCTGGGCTCA GACCTGCCAA GAGCCATATC CGGGAGGACC CTGCCCCTGA 102 0 

CCTAAGCCCA CCCCAAAGGC CAAACTCTCC ACTCCCTCAG CTCGGACACC TTCTCTCCTC 1080 

CCAGATTCCA GTAACTCCCA ATCTTCTCTC TGCAGAGCCC AAATCTTGTG ACAAAACTCA 114 0 

CACATGCCCA CCGTGCCCAG GTAAGC CAGC CCAGGCCTCG CCCTCCAGCT CAAGGCGGGA 12 0 0 

CAGGTGCCCT AGAGTAGCCT GCATCCAGGG ACAGGCCCCA GCCGGGTGCT GACACGTCCA 12 6 0 

CCTCCATCTC TTCCTCAGCA CCTGAACTCC TGGGGGGACC GTCAGTCTTC CTCTTCCCCC 132 0 

CAAAACCCAA GGACACCCTC ATGATCTCCC GGACCCCTGA GGTCACATGC GTGGTGGTGG 13 8 0 

ACGTGAGCCA CGAAGACCCT GAGGTCAAGT TCAACTGGTA CGTGGACGGC GTGGAGGTGC 144 0 

ATAATGCCAA GACAAAGCCG CGGGAGGAGC AGTACAACAG CACGTACCGT GTGGTCAGCG 15 0 0 

TCCTCACCGT CCTGCACCAG GACTGGCTGA ATGGCAAGGA GTACAAGTGC AAGGTCTCCA 156 0 

ACAAAGCCCT CCCAGCCCCC ATCGAGAAAA CCATCTCCAA AGC CAAAGGT GGGACCCGTG 162 0 

GGGTGCGAGG GCCACATGGA CAGAGGCCGG CTCGGCCCAC CCTCTGCCCT GAGAGTGACC 16 80 

GCTGTACCAA CCTCTGTCCC TACAGGGCAG CCCCGAGAAC CACAGGTGTA CACCCTGCCC 174 0 

CCATCCCGGG ATGAGCTGAC CAAGAACCAG GTCAGCCTGA CCTGCCTGGT CAAAGGCTTC 18 0 0 

TATCCCAGCG ACATCGC CGT GGAGTGGGAG AGCAATGGGC AGC CGGAGAA CAACTACAAG 186 0 

ACCACGCCTC CCGTGCTGGA CTCCGACGGC TCCTTCTTCC TCTACAGCAA GCTCACCGTG 192 0 

GACAAGAGCA GGTGG CAGC A GGGGAACGTC TTCTCATGCT CCGTGATGCA TGAGGCTCTG 198 0 

CACAACCACT ACACGCAGAA GAGCCTCTCC CTGTCTCCGG GTAAATGAGT GCGACGGCCG 2 04 0 

GCAAGCCCCC GCTCCCCGGG CTCTCGCGGT CGCACGAGGA TGCTTGGCAC GTACCCCCTG 2100 

TACATACTTC CCGGGCGCCC AGCATGGAAA TAAAGCACCC AGCGCTGCCC TGGGCCCCTG 216 0 

CGAGACTGTG ATGGTTCTTT CCACGGGTCA GGCCGAGTCT GAGGC CTGAG TGGCATGAGG 222 0 
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GAGGCAGAGC 


GGGTCCCACT 


GTCCCCACAC 


TGGCCCAGGC 


TGTGCAGGTG 


TGCCTGGGCC 


2280 


CCCTAGGGTG 


GGGCTCAGCC 


AGGGGCTGCC 


CTCGGCAGGG 


TGGGGGATTT 


GCCAGCGTGG 


2340 


CCCTCCCTCC 


AGCAGCACCT 


GCCCTGGGCT 


GGGCCACGGG 


AAGCCCTAGG 


AGCCCCTGGG 


2400 


GACAGACACA 


CAGCCCCTGC 


CTCTGTAGGA 


GACTGTCCTG 


TTCTGTGAGC 


GCCCCTGTCC 


2460 


TCCCGACCTC 


CATGCCCACT 


CGGGGGCATG 


CCTGGAGGTC 


GACTCTAGAG 


GATCCCCGGG 


2520 


TACCGAGCTC 


GAATTCATGG 


ATGATATCAG 


ATCTGCCGGT 


CTCCCTATAG 


TGAGT CGT AT 


2580 


TAATTTCGAT 


AAGCCAGGTT 


AACCTGCATT 


AATGAATCGG 


CCAACGGGCG 


GGGAGAGGCG 


2640 


GTTTGCGTAT 


TGGGCGCTCT 


TCCGCTTCCT 


CGCTCACTGA 


CTCGCTGCGC 


TCGGTCGTTC 


2700 


GGCTGCGGCG 


AGCGGT AT C A 


GCTCACTCAA 


AGGCGGTAAT 


ACGGTTATCC 


ACAGAATCAG 


2760 


GGGATAACGC 


AGGAAAGAAC 


ATGTGAGCAA 


AAGGCCAGCA 


AAAGGCCAGG 


AAC CGTAAAA 


2820 


AGGCCGCGTT 


GCTGGCGTTT 


TTCCATAGGC 


TCCGCCCCCC 


TGACGAGCAT 


CACAAAAATC 


2880 


GACGCTCAAG 


TCAGAGGTGG 


CGAAACCCGA 


CAGGACTATA 


AAG AT AC C AG 


GCGTTTCCCC 


2940 


CTGGAAGCTC 


CCTCGTGCGC 


TCTCCTGTTC 


CGACCCTGCC 


GCTTACCGGA 


TACCTGTCCG 


3000 


CCTTTCTCCC 


TTCGGGAAGC 


GTGGCGCTTT 


GTCAATGCTC 


ACGCTGTAGG 


TATCTCAGTT 


3060 


CGGTGTAGGT 


CGTTCGCTCC 


AAGCTGGGCT 


GTGTGCACGA 


ACCCCCCGTT 


CAGCCCGACC 


3120 


GCTGCGCCTT 


ATCCGGTAAC 


TATCGTCTTG 


AGTCCAACCC 


GGTAAGACAC 


GACTTATCGC 


3180 


CACTGGCAGC 


AGCCACTGGT 


AACAGGATTA 


GCAGAGCGAG 


GTATGTAGGC 


GGTGCTACAG 


3240 


AGTTCTTGAA 


GTGGTGGCCT 


AACTACGGCT 


ACACTAGAAG 


GACAGTATTT 


GGTATCTGCG 


3300 


CTCTGCTGAA 


GCCAGTTACC 


TTCGGAAAAA 


GAGTTGGTAG 


CTCTTGATCC 


GGCAAACAAA 


3360 


CCACCGCTGG 


TAGCGGTGGT 


TTTTTTGTTT 


GCAAGCAGCA 


GATTACGCGC 


AGAAAAAAAG 


3420 


GATCTCAAGA 


AGATCCTTTG 


ATCTTTTCTA 


CGGGGTCTGA 


CGCTCAGTGG 


AACGAAAA.CT 


3480 


CACGTTAAGG 


GATTTTGGTC 


ATGAGATTAT 


CAAAAAGGAT 


CTTCACCTAG 


ATCCTTTTAA 


3540 


ATTAAAAATG 


AAGTTTTAAA 


TCAATCTAAA 


GTATATATGA 


GTAAACTTGG 


TCTGACAGTT 


3600 


ACCAATGCTT 


AATCAGTGAG 


GCACCTATCT 


CAGCGATCTG 


TCTATTTCGT 


TCATCCATAG 


3660 


TTGCCTGACT 


CCCCGTCGTG 


TAGATAACTA 


CGATACGGGA 


GGGCTTACCA 


TCTGGCCCCA 


3720 


GTGCTGCAAT 


GATAC CGCGA 


GACCCACGCT 


CACCGGCTCC 


AGATTTATCA 


GCAATAAACC 


3780 


AGCCAGCCGG 


AAGGGCCGAG 


CGCAGAAGTG 


GTCCTGCAAC 


TTTATCCGCC 


TCCATCCAGT 


3840 


CTATTAATTG 


TTGCCGGGAA 


GCTAGAGTAA 


GTAGTTCGCC 


AGTTAATAGT 


TTGCGCAACG 


3900 


TTGTTGCCAT 


TGCTACAGGC 


ATCGTGGTGT 


CACGCTCGTC 


GTTTGGTATG 


GCTTCATTCA 


3960 


GCTCCGGTTC 


CCAACGATCA 


AGGCGAGTTA 


CATGATCCCC 


CATGTTGTGC 


AAAAAAGCGG 


4020 


TTAGCTCCTT 


CGGTCCTCCG 


ATCGTTGTCA 


GAAGTAAGTT 


GGCCGCAGTG 


TTATCACTCA 


4080 


TGGTTATGGC 


AGCACTGCAT 


AATTCTCTTA 


CTGTCATGCC 


ATC CGTAAGA 


. TGCTTTTCTG 


4140 


TGACTGGTGA 


. GTACTCAACC 


AAGTCATTCT 


GAGAATAGTG 


TATGCGGCGA 


. CCGAGTTGCT 


4200 


CTTGCCCGGC 


1 GTCAATACGG 


GATAATACCG 


CGCCACATAG 


[ CAGAACTTTA AAAGTGCTCA 


4260 
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TCATTGGAAA 


ACGTTCTTCG 


GGGCGAAAAC 


TCTCAAGGAT 


CTTACCGCTG 


TTGAGATCCA 


4320 


GTTCGATGTA 


ACCCACTCGT 


GCACCCAACT 


GATCTTCAGC 


ATCTTTTACT 


TTCACCAGCG 


4380 


TTTCTGGGTG 


AGCAAAAACA 


GGAAGGCAAA 


ATGCCGCAAA 


AAAGGGAATA 


AGGGCGACAC 


4440 


GGAAATGTTG 


AAT AC T CAT A 


CTCTTCCTTT 


TTCAATATTA 


TTGAAGCATT 


TATCAGGGTT 


4500 


ATTGTCTCAT 


GAGCGGATAC 


ATATTTGAAT 


GTATTTAGAA 


AAATAAACAA 


ATAGGGGTTC 


4560 


CGCGCACATT 


TCCCCGAAAA 


GTGCCACCTG 


ACGTCTAAGA 


AACCATTATT 


ATCATGACAT 


4620 


TAACCTATAA 


AAATAGGCGT 


ATCACGAGGC 


CCTTTCGTCT 


CGCGCGTTTC 


GGTGATGACG 


4680 


GTGAAAACCT 


CTGACACATG 


CAGCTCCCGG 


AGACGGTCAC 


AGCTTGTCTG 


TAAGCGGATG 


4740 


CCGGGAGCAG 


ACAAGCCCGT 


CAGGGCGCGT 


CAGCGGGTGT 


TGGCGGGTGT 


CGGGGCTGGC 


4800 


TTAACTATGC 


GGCATCAGAG 


CAGATTGTAC 


TGAGAGTGCA 


C C AT ATGGAC 


ATATTGTCGT 


4860 


TAGAACGCGG 


CTACAATTAA 


TACATAACCT 


TATGTATCAT 


ACACATACGA 


TTTAGGTGAC 


4920 


ACTATA 
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(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 base pairs 

(B) TYPE: nucleic acid 

(C) S TRAHDEDNE S S : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA C AAGTC CAT C 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 
GCGAGA 

(2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:270: 
TACTGGTACT TCGATCTCTG GGGCCGTGGC ACCCTGGTCA CTGTCTCCTC AG 



52 
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(2) INFORMATION FOR SEQ ID NO: 2 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 07 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271: 

TCTCTGAAGA TCTCCTGTAA GGGCTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCGAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTCGAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 240 

GCGAGACAGG GGGGGGATAG GTACTTCGAT CTCTGGGGCC GTGGCACCCT GGTCACTGTC 30 0 

TCCTCAG 307 

(2) INFORMATION FOR SEQ ID NO: 2 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:272: 
GCTTTTGATA TCTGGGGCCA AGGGACAATG GTCACCGTCT CTTCAG 4 6 

(2) INFORMATION FOR SEQ ID NO: 273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:273: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACATT GGCTAAATGG GGATGCTTTT GATATCTGGG GCCAAGGGAC AATGGTCACC 3 00 

GTCTCTTCAG 310 
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(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 74: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGC CTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AGGGCCTCGG ACAGTGTCAT GTATTACTGT 24 0 

GCGAGACGGG ATTACGATAT TTTGACTGGT TATTATGCGG CTTTTGATAT CTGGGGCCAA 3 00 

GGGACAATGG TCACCGTCTC TTCAG 32 5 

(2) INFORMATION FOR SEQ ID NO: 2 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

( ii ) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 75: 
TACTTTGATA TCTGGGGCCA AGGGACAATG GTCACCGTCT CTTCAG 46 

(2) INFORMATION FOR SEQ ID NO: 2 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:276: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTC CCATCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCTCCA TCTCAGTCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 240 

GCGAGAGTGG TTCGGGGATT TATTATTTAC TTTGATATCT GGGGCCAAGG GACAATGGTC 30 0 

ACCGTCTCTT CAG 313 
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(2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 



TCTCTGAAGA 


TCTCCTGTAA 


GGGTTCTGGA 


TACAGTTTTT 


CCGACTACTG 


GATCGGCTGG 


60 


GTGCGCCAGA 


TGCCCGGGAA 


AGGCCTGGAG 


TGGATGGGGA 


TCATCTATCC 


TGGTGACTCT 


120 


GACACCAGAT 


ACAGCCCGTC 


CTTCCAGGGC 


CAGGTCACCA 


TCTCAGCCGA 


CAAGTCCATC 


180 


AACACCGCCT 


TCCTGCAGTG 


GAACACCCTG 


GAGGCTTCGG 


ACACCGCCAT 


GTATTACTGT 


240 


GCGAGAGGGT 


ATTATTATGA 


TTCGGGGACT 


TATTATAAGT 


CTACCCCTTT 


GATATCTGGG 


300 


GCCAAGGGAC 


AATGGTCACC 


GTCTCTTCAG 
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(2) INFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 01 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:278: 



TCTCTGAAGA 


TCTCCTGTAA 


GGGTTCTGGA 


TACAGCTTTA 


CCAGCTACTG 


GATCGGCTGG 


60 


GTGCGCCAGA 


TGCCCGGGAA 


AGGCCTGGAG 


TGGATGGGGA 


TCATCTATCC 


TGGTGACTCT 


120 


GAT AC C AG AT 


ACAGCCCGTC 


CTTCCAAGGC 


CAGGTCACCA 


TCTCAGCCGA 


CAAGTCCATC 


180 


AGCACCGCCT 


ACCTGCAGTG 


GAGCAGCCTG 


AAGGCCTCGG 


ACACCGCCAT 


GTATTACTGT 


240 


GCGAGACTAA 


CTGGCCTCTT 


TAATATCTGG 


GGC CAAGGGA 


CAATGGT C AC 


CGTCTCTTCA 


300 


G 
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(2) INFORMATION FOR SEQ ID NO: 27 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:279: 
TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 
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GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 
GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 
AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 
GCGAGACATC TTTACTTTGA TATCTGGGGC CAAGGGACAA AGGTCACCGT CTCTTCAG 

(2) INFORMATION FOR SEQ ID NO: 28 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:280: 
TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGTTTTA GCAGCTACTG GATCGGCTGG 
GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 
GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 
AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 
GCGAGACATC TTTACTTTGA TATCTGGGGC CAAGGGACAA AGGTCACCGT CTCTTCAG 

(2) INFORMATION FOR SEQ ID NO: 2 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 98 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 81: 
TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 
GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 
GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 
AGTACCGCCT ACCTGCAGTG GAGCAGCCTG AGGGCCTCGG ACACCGCCAT TTATTACTGT 
GCGAGACATC TTTACTTTGA TATCTGGGGC CAAGGGACAA AGGTCACCGT CTCTTCAG 

(2) INFORMATION FOR SEQ ID NO: 2 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 298 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



120 
180 
240 
298 



60 
120 
180 
240 
298 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:282: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATCGCTGT 24 0 

GCGAGACATC TTTACTTTGA TATCTGGGGC CAAGGGACAA AGGTCAC CGT CTCTTCAG 2 98 

(2) INFORMATION FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:283: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCCAATG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTGGCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGTACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 240 

GCGAGACAAG GTTGATATCT GGGGCCAAGG GACAATGGTC ACCGTCTCTT CAG 2 93 

(2) INFORMATION FOR SEQ ID NO: 2 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:284: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAACTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACAAA CTTTGATATC TGGGGCCAAG GGACAATGGT CACCGTCTCT TCAG 2 94 
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(2) INFORMATION FOR SEQ ID NO: 2 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 5: 



TCTCTGAAGA 


TCTCCTGTAA 


GGGTTCTGGA 


TACAGCTTTA 


CCAGCTACTG 


GATCGGCTGG 


60 


GTGCGCCAGA 


TGCCCGGGAA 


AGGCCTGGAG 


TGGATGGGGA 


TCATCTATCC 


TGGTGACTCT 


120 


GAT AC C AG AT 


ACAGCCCGTC 


CTTCCAAGGC 


CAGGTCACCA 


TCTCAGCCGA 


CAAGTCCATC 


180 


AGGACCGCCT 


ACCTGCAGTG 


GAGCAGCCTG 


AAGGCCTCGG 


ACACCGCCAT 


GTATTACTGT 


240 


GCGAGACATG 


GTATAGCAGC 


AGCTGGTACG 


TGGTTCCGAC 


CCCATGGGGC 


CAAGGGACAA 


300 


TGGTCACCGT 


CTCTTCAG 
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(2) INFORMATION FOR SEQ ID NO:286: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:286: 



TCTCTGAAGA 


TCTCCTGTAA 


GGGTTCTGGA 


TACAGCTTTA 


CCAGCTACTG 


GATCGGCTGG 


60 


GTGCGCCAGA 


TGCCCGGGAA 


AGGCCTGGAG 


TGGATGGGGA 


TCATCTATCC 


TGGTGACTCT 


120 


GATACCAGAT 


ACAGCCCGTC 


CTTCCAAGGC 


CAGGTCACCA 


TCTCAGCCGA 


CAAGTCCATC 


180 


AGCACCGCCT 


ACCTGCAGTG 


GAGCAGCCTG 


AAGGCCTCGG 


ACACCGCCAT 


GTATTACTGT 


240 


GCGGCCGGTA 


TACCAGCAGC 


TGGTTTTTGA 


TATCTGGGGC 


CAAGGGACAA 


TGGTCACCGT 


300 


CTCTTCAG 
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(2) INFORMATION FOR SEQ ID NO:287: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 298 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 ine ar 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 
TCTCTGAAGA TCTCCTGTAG GGGTTCTGGA TACAGCTTTT CCAGTTACTG GATCGCCTGG 60 
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GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GAAACCAGAT ACAGTCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCGTCGG ACACCGCCAT GTATTACTGT 240 

GCGAGACAGG GCTACTTTGA TATCTGGGGC CAAGGGACAA TGGTCACCGT CTCTTCAG 2 98 

(2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:288: 

TCTCTGAAGA TCTCCTGTAA GGTTTCTGGA TACAGCTTAA CCAGTTATTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACAAA GGGGTACTTT GATATCTGGG GCCAAGGGAC AATGGTCACC GTCTCTTCAG 3 00 

(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 01 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 
TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGGGGAT CGTGGTACTT TGATATC TGG GGCCAAGGGA CAATGGTCAC CGTCTCTTCA 300 
G 

(2) INFORMATION FOR SEQ ID NO: 29 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:290: 
AACTGGTTCG ACCCCTGGGG CCAGGGAACC CTGGTCACCG TCTCCTCAG 4 9 

(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 07 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 91: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAACTTTA CCACCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCGTC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACTCC C C AATGAC AG TTGGTTCGAC CCCTGGGGCC AGGGAACCCT GGTCACCGTC 3 00 
TCCTCAG 
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(2) INFORMATION FOR SEQ ID NO: 2 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:292: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACGGG GGTACTATGG TTCGGGGAGT TATTATAACT GGTTCGACCC CTGGGGCCAG 3 00 

GGAACCCTGG TCACCGTCTC CTCAG 325 

(2) INFORMATION FOR SEQ ID NO: 2 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:293: 
TACTACTACT ACTACGGTAT GGACGTCTGG GGGCAAGGGA CCACGGTCAC CGTCTCCTCA 6 0 

G 61 

(2) INFORMATION FOR SEQ ID NO: 2 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAACTTTA TCACCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TACCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTC CAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT AC CTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACATG AGCAGCTGGT ACAGGGTTAC TACTACTACG GTATGGACGT CTGGGGGCAA 3 00 

GGGACCACGG TCACCGTCTC CTCAG 32 5 

(2) INFORMATION FOR SEQ ID NO: 2 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACACCTTTA CCAGTTACTG GATCGCCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTC CAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT AC CTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGAGATA TGGGGGGGGC CTCACTACTT CTACTTCGGT ATGGACGTCT GGGGGCAAGG 3 00 

GACCACGGTC ACCGTCTCCT CAG 323 
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(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 307 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:296: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTG CCAACTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTTTCC TGGTGACTCT 12 0 

GAT AC C AGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAACTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACACC ACGACTACTA CGGTATGGAC GTCTGGGGGC AAGGGAC C AC GGTCACCGTC 3 00 
TCCTCAG 
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(2) INFORMATION FOR SEQ ID NO: 297: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 301 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 60 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTTTCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGCC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AACACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACGCT ACTACGGTAT GGACGTCTGG GGGCAAGGGA CCACGGTCAC CGTCTCCTCA 30 0 
G 

(2) INFORMATION FOR SEQ ID NO: 298: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: j oin { 199 . . 247 , 419.. 714) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:298: 

TTTTCTGGCC TGACAACCAG GGTGGCGCAG GATGCTCAGT GCAGAGAGGA AGAAGCAGGT 6 0 

GGTCTCTGCA GCTGGAAGCT CAGCTCCCAC CCAGCTGCTT TGCATGTCCC TCCCAGCTGC 120 

CCTACCTTCC AGAGCC CATA TCAATGCCTG TGTCAGAGCC CTGGGGAGGA ACTGCTCAGT 18 0 

TAGGAC C C AG AGGGAACC ATG GAA GCC CCA GCT CAG CTT CTC TTC CTC CTG 231 

Met Glu Ala Pro Ala Gin Leu Leu Phe Leu Leu 
15 10 

CTA CTC TGG CTC CCA G GTGAGGGGGA AC C ATGAGGT GGTTTTGCAC 2 77 

Leu Leu Trp Leu Pro 
15 

ATTAGTGAAA ACTCTTGCCA CCTCTGCTCA GCAAGAAATA TAATTAAAAT TCAAAGTATA 33 7 

TCAACAATTT TGGCTCTACT CAAAGACAGT TGGTTTGATC TTGATTACAT GAGTGCATTT 3 97 

CTGTTTTATT TCCAATTTCA G AT ACC ACC GGA GAA ATT GTG TTG ACA CAG 447 

Asp Thr Thr Gly Glu lie Val Leu Thr Gin 
20 25 

TCT CCA GCC ACC CTG TCT TTG TCT CCA GGG GAA AGA GCC ACC CTC TCC 49 5 

Ser Pro Ala Thr Leu Ser Leu Ser Pro Gly Glu Arg Ala Thr Leu Ser 
30 35 40 

TGC AGG GCC AGT CAG AGT GTT AGC AGC TAC TTA GCC TGG TAC CAA CAG 543 
Cys Arg Ala Ser Gin Ser Val Ser Ser Tyr Leu Ala Trp Tyr Gin Gin 
45 50 55 

AAA CCT GGC CAG GCT CCC AGG CTC CTC ATC TAT GAT GCA TCC AAC AGG 5 91 

Lys Pro Gly Gin Ala Pro Arg Leu Leu lie Tyr Asp Ala Ser Asn Arg 
60 65 70 

GCC ACT GGC ATC CCA GCC AGG TTC AGT GGC AGT GGG TCT GGG ACA GAC 63 9 

Ala Thr Gly lie Pro Ala Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp 
75 80 85 90 

TTC ACT CTC ACC ATC AGC AGC CTA GAG CCT GAA GAT TTT GCA GTT TAT 687 
Phe Thr Leu Thr lie Ser Ser Leu Glu Pro Glu Asp Phe Ala Val Tyr 
95 100 105 

TAC TGT CAG CAG CGT AGC AAC TGG CCT CCCACAGTGA TTCCACATGA 734 
Tyr Cys Gin Gin Arg Ser Asn Trp Pro 
110 115 

AACAAAAACC CCAACAAGAC CATCAGTGTT TACTAGATTA TTATAC CAGC TGCTTCCTTT 794 

ACAGACAGCT AGTGGGGT 812 

(2) INFORMATION FOR SEQ ID NO: 2 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 

Met Glu Ala Pro Ala Gin Leu Leu Phe Leu Leu Leu Leu Trp Leu Pro 
15 10 15 
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Asp Thr Thr Gly Glu lie Val Leu Thr Gin Ser Pro Ala Thr Leu Ser 
20 25 30 

Leu Ser Pro Gly Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser Gin Ser 
35 40 45 

Val Ser Ser Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ala Pro 
50 55 60 

Arg Leu Leu lie Tyr Asp Ala Ser Asn Arg Ala Thr Gly lie Pro Ala 
65 70 75 80 

Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr lie Ser 
85 90 95 

Ser Leu Glu Pro Glu Asp Phe Ala Val Tyr Tyr Cys Gin Gin Arg Ser 
100 105 110 

Asn Trp Pro 
115 

(2) INFORMATION FOR SEQ ID NO: 3 00: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: join (18 0. .228, 398. .693) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 

AGGGCGGCGC AGATGCTCAG TGCAGAGAGA AGAAACAGGT GGTCTCTGCA GCTGGAAGCT 6 0 

CAGCTCCCAC CCCAGCTGCT TTGCATGTCC CTCCCAGCTG CCCTACCTTC C AGAGC C CAT 12 0 

ATCAATGCCT GGGTCAGAGC TCTGGGGAGG AACTGCTCAG TTAGGAC CCA GACGGAACC 179 

ATG GAA GCC CCA GCG CAG CTT CTC TTC CTC CTG CTA CTC TGG CTC ACA G 228 
Met Glu Ala Pro Ala Gin Leu Leu Phe Leu Leu Leu Leu Trp Leu Thr 
15 10 15 

GTGAGGGGAA TATGAGGTGT CTTTGCACAT CAGTGAAAAC TCCTGCCACC TCTGCTCAGC 288 

AAGAAATATA ATTAAAATTC AAAAT AG AT C AACAATTTTG GCTCTACTCA AAGACAGTGG 348 

GTTTGATTTT GATTACATGA GTGCATTTCT GTTTTATTTC CAATTTCAG AT ACC 4 02 

Asp Thr 

ACC GGA GAA ATT GTG TTG ACA CAG TCT CCA GCC ACC CTG TCT TTG TCT 4 50 

Thr Gly Glu lie Val Leu Thr Gin Ser Pro Ala Thr Leu Ser Leu Ser 
20 25 30 

CCA GGG GAA AGA GCC ACC CTC TCC TGC AGG GCC AGT CAG GGT GTT AGC 498 
Pro Gly Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser Gin Gly Val Ser 
35 40 45 50 
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AGC TAC TTA GCC TGG TAC CAG CAG AAA CCT GGC CAG GCT CCC AGG CTC 546 
Ser Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ala Pro Arg Leu 
55 60 65 

CTC ATC TAT GAT GCA TCC AAC AGG GCC ACT GGC ATC CCA GCC AGG TTC 594 
Leu lie Tyr Asp Ala Ser Asn Arg Ala Thr Gly lie Pro Ala Arg Phe 
70 75 80 

AGT GGC AGT GGG CCT GGG ACA GAC TTC ACT CTC ACC ATC AGC AGC CTA 642 
Ser Gly Ser Gly Pro Gly Thr Asp Phe Thr Leu Thr lie Ser Ser Leu 
85 90 95 

GAG CCT GAA GAT TTT GCA GTT TAT TAC TGT CAG CAG CGT AGC AAC TGG 6 90 

Glu Pro Glu Asp Phe Ala Val Tyr Tyr Cys Gin Gin Arg Ser Asn Trp 
100 105 110 

CAT CCCACAGTGA TTCCACATGA AACAAAAACC CCAACAAGAC CATCAGTGTT 743 

His 

115 

TACTAGATTA TTATAC CAGC TGCTTCCTTT ACAGACAGCT AGTGGGGTGG CCACTCAGTG 803 

TTAGCATCTC AGCTCTATTT GGC CATTTTG GAGTTCAAGT TGTCAAGTCC AAAATTACTT 86 3 

ATGTTAGTCC ATTGCATCAT ACCATTTCAG TGTGGCT 9 00 

(2) INFORMATION FOR SEQ ID NO: 3 01: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 01: 

Met Glu Ala Pro Ala Gin Leu Leu Phe Leu Leu Leu Leu Trp Leu Thr 
1 5 10 15 

Asp Thr Thr Gly 'Glu lie Val Leu Thr Gin Ser Pro Ala Thr Leu Ser 
20 25 30 

Leu Ser Pro Gly Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser Gin Gly 
35 40 45 

Val Ser Ser Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ala Pro 
50 55 60 

Arg Leu Leu He Tyr Asp Ala Ser Asn Arg Ala Thr Gly He Pro Ala 
65 70 75 80 

Arg Phe Ser Gly Ser Gly Pro Gly Thr Asp Phe Thr Leu Thr He Ser 
85 90 95 

Ser Leu Glu Pro Glu Asp Phe Ala Val Tyr Tyr Cys Gin Gin Arg Ser 
100 105 HO 



Asn Trp His 
115 
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(2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: join (116 164 , 352.. 650) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:302: 

CCGCCCCAGC TGCTTTGCAT GTCCCTCCCA GCCGCCCTGC AGTC CAGAGC CCATATCAAT 6 0 

GCCTGGGTCA GAGCTCTGGA GAAGAGCTGC TCAGTTAGGA ACCCCAGAGG GAACC ATG 118 

Met 
1 

GAA ACC CCA GCG CAG CTT CTC TTC CTC CTG CTA CTC TGG CTC CCA G 164 
Glu Thr Pro Ala Gin Leu Leu Phe Leu Leu Leu Leu Trp Leu Pro 
5 10 15 

GTGAGGGGAA CATGGGATGG TTTTGCATGT CAGTGAAAAC CCTCTCAAGT CCTGTTACCT 224 

GGCAACTCTG CTCAGTCAAT ACAATAATTA AAGCTCAATA TAAAGCAATA ATTCTGGCTC 2 84 

TTCTGGGAAG ACAATGGGTT TGATTTAGAT TACATGGGTG ACTTTTCTGT TTTATTTCCA 344 

ATCTCAG AT ACC ACC GGA GAA ATT GTG TTG ACG CAG TCT CCA GGC ACC 3 92 

Asp Thr Thr Gly Glu lie Val Leu Thr Gin Ser Pro Gly Thr 
20 25 30 

CTG TCT TTG TCT CCA GGG GAA AGA GCC ACC CTC TCC TGC AGG GCC AGT 440 
Leu Ser Leu Ser Pro Gly Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser 
35 40 45 

CAG AGT GTT AGC AGC AGC TAC TTA GCC TGG TAC CAG CAG AAA CCT GGC 48 8 

Gin Ser Val Ser Ser Ser Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly 
50 55 60 

CAG GCT CCC AGG CTC CTC ATC TAT GGT GCA TCC AGC AGG GCC ACT GGC 53 6 

Gin Ala Pro Arg Leu Leu lie Tyr Gly Ala Ser Ser Arg Ala Thr Gly 
65 70 75 

ATC CCA GAC AGG TTC AGT GGC AGT GGG TCT GGG ACA GAC TTC ACT CTC 584 
lie Pro Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu 
80 85 90 

ACC ATC AGC AGA CTG GAG CCT GAA GAT TTT GCA GTG TAT TAC TGT CAG 632 
Thr He Ser Arg Leu Glu Pro Glu Asp Phe Ala Val Tyr Tyr Cys Gin 
95 100 105 HO 

CAG TAT GGT AGC TCA CCT CCCACAGTGA TTCAGCTTGA AACAAAAACC 680 
Gin Tyr Gly Ser Ser Pro 
115 

TCTGCAAGAC CTTCATTGTT TACTAGATTA TACCAGCTGC TTCCTTTACA GATAGCTGCT 74 0 

GCAATGACAA CTCAATTTAG CATCTCTCTC TGCTTGGGCA TTTTGGGGAT CTTAAAAAAG 8 00 

TAATCCCTTG ATATATTTTT GACTCTGATT CCTGCATTTT TCCTCAGACC AAGATGGACA 8 60 
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GCCAGGTTTA AGCACAGTTT CACAGTAATG GCCACTGGAT 90 0 

(2) INFORMATION FOR SEQ ID NO:303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:303: 

Met Glu Thr Pro Ala Gin Leu Leu Phe Leu Leu Leu Leu Trp Leu Pro 
15 10 15 

Asp Thr Thr Gly Glu lie Val Leu Thr Gin Ser Pro Gly Thr Leu Ser 
20 25 30 

Leu Ser Pro Gly Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser Gin Ser 
35 40 45 

Val Ser Ser Ser Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ala 
50 55 60 

Pro Arq Leu Leu He Tyr Gly Ala Ser Ser Arg Ala Thr Gly He Pro 
65 70 75 80 

Asp Arq Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr He 
85 90 95 

Ser Arg Leu Glu Pro Glu Asp Phe Ala Val Tyr Tyr Cys Gin Gin Tyr 
100 105 HO 

Gly Ser Ser Pro 
115 

(2) INFORMATION FOR SEQ ID NO: 3 04: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
'(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: j oin (226 . . 28 0 , 406.. 701) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 04: 

AAACACATTC TCTGCAGACA AATTTGAGCT ACCTTGATCT TACCTGGACA GGTGGGGACA 6 0 

CTGAGCTGGT GCTGAGTTAC TCAGATGCGC CAGCTCTGCA GCTGTGCCCA GCCTGCCCCA 12 0 

TCCCCTGCTC ATTTGCATGT TCCCAGAGCA CAACCTCCTG CCCTGAAGCC TTATTAATAG 18 0 

GCTGGTCAGA CTTTGTGCAG GAATCAGACC CAGTCAGGAC ACAGC ATG GAC ATG 2 34 

Met Asp Met 
1 
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AGG GTC CTC GCT CAG CTC CTG GGG CTC GTG CTG CTC TGT TTC CCA G 28 0 

Arg Val Leu Ala Gin Leu Leu Gly Leu Leu Leu Leu Cys Phe Pro 
5 10 15 

GTAAGGATGG AGAACACTAG CAGTTTACTC AGCCCAGGGT GCTCAGTACT GCTTTACTAT 34 0 

TCAGGGAAAT TCTCTTACAA CATGATTAAT TGTGTGGACA TTTGTTTTTA TGTTTCCAAT 40 0 

CTCAG GT GCC AGA TGT GAC ATC CAG ATG ACC CAG TCT CCA TCC TCA 446 
Gly Ala Arg Cys Asp lie Gin Met Thr Gin Ser Pro Ser Ser 
20 25 30 

CTG TCT GCA TCT GTA GGA GAC AGA GTC ACC ATC ACT TGT CGG GCG AGT 4 94 

Leu Ser Ala Ser Val Gly Asp Arg Val Thr lie Thr Cys Arg Ala Ser 
35 40 45 

CAG GGT ATT AGC AGC TGG TTA GCC TGG TAT CAG CAG AAA CCA GAG AAA 542 
Gin Gly He Ser Ser Trp Leu Ala Trp Tyr Gin Gin Lys Pro Glu Lys 
50 55 60 

GCC CCT AAG TCC CTG ATC TAT GCT GCA TCC AGT TTG CAA AGT GGG GTC 5 90 

Ala Pro Lys Ser Leu He Tyr Ala Ala Ser Ser Leu Gin Ser Gly Val 
65 70 75 80 

CCA TCA AGG TTC AGC GGC AGT GGA TCT GGG ACA GAT TTC ACT CTC ACC 63 8 

Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr 
85 90 95 

ATC AGC AGC CTG CAG CCT GAA GAT TTT GCA ACT TAT TAC TGC CAA CAG 686 
He Ser Ser Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Gin Gin 
100 105 HO 

TAT AAT AGT TAC CCA CCCACAGTGT TACACACCCA AACATAAACC CCCAGGGAAG 741 
Tyr Asn Ser Tyr Pro 
115 

CAGATGTGTG AGGCTGGGCT GCCCCAGCTG CTTCTCCTGA TGCCTCCATC AGCTGAGAGT 801 
GTTCCTCAGA TGCAGCCACA CTCTGATGGT GTTGGTAGAT GGGGAC 84 7 



(2) INFORMATION FOR SEQ ID NO: 3 05: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 05: 

Met Asp Met Arg Val Leu Ala Gin Leu Leu Gly Leu Leu Leu Leu Cys 
15 10 15 

Phe Pro Gly Ala Arg Cys Asp He Gin Met Thr Gin Ser Pro Ser Ser 
20 25 30 

Leu Ser Ala Ser Val Gly Asp Arg Val Thr He Thr Cys Arg Ala Ser 
35 40 45 

Gin Gly He Ser Ser Trp Leu Ala Trp Tyr Gin Gin Lys Pro Glu Lys 
50 55 60 

Ala Pro Lys Ser Leu He Tyr Ala Ala Ser Ser Leu Gin Ser Gly Val 
65 70 75 80 
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Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr 
85 90 95 

He Ser Ser Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Gin Gin 
100 105 HO 

Tyr Asn Ser Tyr Pro 
115 



(2) INFORMATION FOR SEQ ID NO: 306: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 06: 
GCCTCGGACA CCGCCATGTA TTACTGTGTG AGACATTTAT GGTTCGGGGA GTTACGCGGT 
GTGAACGTCT GGGGCCAAGG GACCACGGTC ACCGTCTCCT CAGC CAAAAC GACACCCCCA 
TCTGTCTATC CACT 



(2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 
GCCTCGGACA CCGCCATGTA TTACTGTGCG AGACACTGGG CATTGGATGC TCTTGATGTC 
TGGGGCCAAG GGACAATGCT CACCGTCTCT TCAGCCAAAA CGACACCCCC ATCTGTCTAT 
CCACT 



(2) INFORMATION FOR SEQ ID NO: 308: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:308: 
GCCTCGGACA CCGCCATGTA TTACTGTGCG AGAACTGGGG ATGATGCTTT TGATATCTGG 



60 
120 
134 



60 
120 
125 
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GGCCAAGGGA CAATGGTCAC CGTCTCTTCA GC C AAAAC AA CACCCCCATC AGTCTATCCA 12 0 

CT 122 

(2) INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 
GACTCGGACA CCGCCATGTA TTACTGTGCG AGACAGGGGA GAGATGCTTT AGATATCTGG 
GGCCAAGGGA CAATGGTCAC CGTCTCTTCA GC CAAAACAA CACCCCCATC AGTCTATCCA 
CT 

(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 
GCCTCGGACA CCGCCATGTA TTATTGTGTG AGACATAGGG ACTATATTTC GGGGAGTTAT 
TTTCCTGACT ACTGGGGCCA GGGAACCCTG GTCACCGTCT CCTCAGCCAA AACAACACCC 
C CAT C AGT C T ATCCACT 

(2) INFORMATION FOR SEQ ID NO: 3 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 
GCCTCGGACA CCGCCATGTA TTACTGTGCG AGAACTGGGG ATGATGCTTT TGATATCTGG 
GGCCAAGGGA CAATGGTCAC CGTCTCTTCA GCCAAAACAA CACCCCCATC AGTCTATCCA 
CT 



60 
120 
122 



60 
120 
137 



60 
120 
122 
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(2) INFORMATION FOR SEQ ID NO:312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 base pairs 

(B) TYPE: nucleic acid 

(C) S TRANDEDNE S S : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:312: 
GCCTCGGACA CCGCCATGTA TTACTGTGCG AGACATGGGT CTATGGATAT CTGGGGCCAA 
GGGACAATGG TCACCGTCTC TTCAGCTACA ACAACAGCCC CATCTGTCTA TCCCTT 



(2) INFORMATION FOR SEQ ID NO:313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 8 base pairs 

(B) TYPE: nucleic acid 

(C) S TRANDEDNE SS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:313: 
GCCTCGGACA CCGCCATGTA TTACTGTGCG AGAGAGAGCG GTCACTGGGG ATCGTTTGAC 
TATTGGGGCC AGGGAACCCT GGTCACCGTC TCCTCAGCTA CAACAACAGC CCCATCTGTC 
TATCCCTT 



(2) INFORMATION FOR SEQ ID NO: 3 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 
GCCTCGGACA CCGCCATGTA TTACTGTGCG AGAAGGGACC CCCCTGATGC TTTTGATATC 
TGGGGCCAAG GGACAATGGT CACCGTCTCT TCAGCTACAA CAACAGCCCC ATCTGTCTAT 
CCCTT 



(2) INFORMATION FOR SEQ ID NO:315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 base pairs 

(B) TYPE: nucleic acid 

(C) S TRANDEDNE S S : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 

GCCTCGGACA CCGCCATGTA TTACTGTGCG AGACGGGGGC CTTACTACTA CTACGGTATG 6 0 

GACGTCTGGG GCCAAGGGAC CACGGTCACC GTCTCCTCAG CTACAACAAC AGCCCCATCT 12 0 

GTCTATCCCT T 131 

(2) INFORMATION FOR SEQ ID NO: 316: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1674 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 

CGAGAGGGGC GGGGGGAAGA CTACTATCCC AGGCAGGTTT TAGGTTC CAG AGTCTGCGAG 6 0 

AAATCCCACC ATCTACCCAC TGACACTCCC ACCAGTCCTG TGCAGTGATC CCGTGATAAT 12 0 

CGGCTGCCTG ATTCACGATT ACTTCCCTTT CGGCACGATG AATGTGACCT GGGGAAAGAG 18 0 

TGGGAAGGAT ATAACCACCG TGAACTTTCC ACCTGCCCTC GCCTCTGGGG GACGGTACAC 240 

CATGAGCAGC CAGTTAACCC TGCCAGCTGT CGAGTGCCCA GAAGGAGAGT CCGTGAAATG 3 00 

TTCCGTGCAA CATGACTCTA ACCCCGTCCA AGAATTGGAT GTGAATTGCT CTGGTAAAGA 360 

ACGTTAGGGG GTCAGCTAGG GGTGGGATAA GTCCTACCTT AT CTAGATC C ATATATC CCT 42 0 

CTGATGCACA CCCTCACAGG AATCCCTCAG AAACCTCCAC TATGGGGATT GGGGGAAGGA 48 0 

AGCGTAAACA GGTCTAGAAG GAGCTGGAGG CCTCAGAACA TCCAGAAACG GGGACAGCAA 54 0 

AGGAGACAAG GAGAATATAC TGATTTGCTA GGACATCTTC TGTTACAGGT CCTACTCCTC 600 

CTCCTCCTAT TACTATTCCT TCCTGCCAGC CCAGCCTGTC ACTGCAGCGG CCAGCTCTTG 660 

AGGACCTGCT CCTGGGTTCA GATGCCAGCA TCACATGTAC TCTGAATGGC CTGAGAAATC 72 0 

CTGAGGGAGC TGCTTTCACC TGGGAGCCCT CCACTGGGAA GGATGCAGTG CAGAAGAAAG 78 0 

CTGCGCAGAA TTCCTGCGGC TGCTACAGTG TGTCCAGCGT CCTGCCTGGC TGTGCTGAGC 84 0 

GCTGGAACAG TGGCGCATCA TTCAAGTGCA CAGTTACCCA TCCTGAGTCT GGCACCTTAA 900 

CTGGCACAAT TGCCAAAGTC ACAGGTGAGC T CAGATGC AT ACCAGGACAT TGTATGACGT 960 

TCCCTGCTCA CATGCCTGCT TTCTTCCTAT AATACAGATG CTCAACTAAC TGCTCATGTC 102 0 

CTTATAT CAC AGAGGGAAAT TGGAGCTATC TGAGGAACTG C C C AGAAGGG AAGGGCAGAG 108 0 

GGGTCTTGCT CTCCTTGTCT GAGC CATAAC TCTTCTTTCT ACCTTCCAGT GAACACCTTC 114 0 

CCACCCCAGG TCCACCTGCT ACCGCCGCCG TCGGAGGAGC TGGCCCTGAA TGAGCTCTTG 12 00 

TCCCTGACAT GCCTGGTGCG AGCTTTCAAC CCTAAAGAAG TGCTGGTGCG ATGGCTGCAT 12 6 0 

GGAAATGAGG AGCTGTCCCC AGAAAGCTAC CTAGTGTTTG AGCCCCTAAA GGAGC CAGGC 13 2 0 

GAGGGAGCCA CCACCTACCT GGTGACAAGC GTGTTGCGTG TATCAGCTGA AACCTGGAAA 13 8 0 
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CAGGGTGACC AGTACTCCTG CATGGTGGGC CACGAGGCCT TGCCCATGAA CTTCACCCAG 
AAGAC CATCG ACCGTCTGTC GGGTAAACCC ACCAATGTCA GCGTGTCTGT GATCATGTCA 
GAGGGAGATG GCATCTGCTA CTGAGCCACC CTGCCTGTCC CTACTCCTAG AATAAACTCT 
GTGCTCATCC AAAGTATCCC TGCACTTCCA CCCAGTGCCT GTCCACCACC CTGGGGTCTA 
CGAAACACAG GGAGGGGT C A GGGCCCAGGG AGGGAGAAAT ACCACCACCT AAGC 

(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 
GCCTCGGACA CCGCCATGTA TTACTGTGTG AGACATTTAT GGTTCGGGGA GTTACGCGGT 
GTGAACGTCT GGGGCCAAGG GACCACGGTC ACCGTCTCCT CAGC CAAAAC GACACCCCCA 
TCTGTCTATC CAC 

(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:318: 
GCCTCGGACA CCGCCATGTA TTACTGTGCG AGACACTGGG CATTGGATGC TCTTGATGTC 
TGGGGCCAAG GGACAATGCT CACCGTCTCT TCAGCCAAAA CGACACCCCC ATCTGTCTAT 
CCAC 

(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 
GCCTCGGACA CCGCCATGTA TTACTGTGCG AGAACTGGGG ATGATGCTTT TGATATCTGG 6 0 
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GGCCAAGGGA CAATGGTCAC CGTGTCTTCA GCCAAAACAA CACCCCCATC AGTCTATCCA 12 0 



C 



121 



(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:320: 
GACTCGGACA CCGCCATGTA TTACTGTGCG AGACAGGGGA GAGATGCTTT AGATATCTGG 6 0 

GGCCAAGGGA CAATGGTCAC CGTCTCTTCA GCCAAAACAA CACCCCCATC AGTCTATCCA 12 0 

C 



121 



(2) INFORMATION FOR SEQ ID NO: 321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

GCCTCGGACA CCGCCATGTA TTATTGTGTG AGACATAGGG ACTATATTTC GGGGAGTTAT 60 

TTTCCTGACT ACTGGGGCCA GGGAACCCTG GTCACCGTCT CCTCAGCCAA AACAACACCC 12 0 

CCATCAGTCT ATCCAC 136 

(2) INFORMATION FOR SEQ ID NO: 32 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:322: 

GCCTCGGACA CCGCCATGTA TTACTGTGCG AGAACTGGGG ATGATGCTTT TGATATCTGG 6 0 

GGCCAAGGGA CAATGGTCAC CGTCTCTTCA GCCAAAACAA CACCCCCATC AGTCTATCCA 12 0 

C 121 
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(2) INFORMATION FOR SEQ ID NO:323: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 115 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:323: 
GCCTCGGACA CCGCCATGTA TTACTGTGCG AGACATGGGT CTATGGATAT CTGGGGCCAA 6 0 

GGGACAATGG TCACCGTCTC TTCAGCTACA ACAACAGCCC CATCTGTCTA TCCCT 115 

(2) INFORMATION FOR SEQ ID NO:324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 

GCCTCGGACA CCGCCATGTA TTACTGTGCG AGAGAGAGCG GTCACTGGGG ATCGTTTGAC 6 0 

TATTGGGGCC AGGGAACCCT GGTCACCGTC TCCTCAGCTA CAACAACAGC CCCATCTGTC 12 0 

TATCCCT 127 

(2) INFORMATION FOR SEQ ID NO: 32 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 

GCCTCGGACA CCGCCATGTA TTACTGTGCG AGAAGGGACC CCCCTGATGC TTTTGATATC 6 0 

TGGGGCCAAG GGACAATGGT CACCGTCTCT TCAGCTACAA CAACAGCCCC ATCTGTCTAT 12 0 

CCCT 124 

(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 326: 
GCCTCGGACA CCGCCATGTA TTACTGTGCG AGACGGGGGC CTTACTACTA CTACGGTATG 
GACGTCTGGG GCCAAGGGAC CACGGTCACC GTCTCCTCAG CTACAACAAC AGCCCCATCT 
GTCTATCCCT 

(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 7: 
TACTGGTACT TCGATCTCTG GGGCCGTGGC AC 

(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: 
TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 
GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 
GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTC CATC 
AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 
GCGAGACAAG GGACTGGGGA GGACTGGTAC TTCGATCTCT GGGGC CGTGG CAC 

(2) INFORMATION FOR SEQ ID NO: 32 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 
TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 
GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 
GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTC CATC 
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AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACAGA AGGGGGGAAG GGGGTACTTC GATCTCTGGG GCCGTGGCAC 2 90 

(2) INFORMATION FOR SEQ ID NO: 33 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 0: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGAACTG GGTGGTACTG GTACTTCGAT CTCTGGGGCC GTGGCAC 28 7 

(2) INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 31: 
GCTTTTGATA TCTGGGGCCA AGGGAC 26 

(2) INFORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:332: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GCGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACATG AAACTGGGGA TCCGGGGGCT TTTGATATCT GGGGCCAAGG GAC 2 93 
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(2) INFORMATION FOR SEQ ID NO:333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:333: 
TACTTTGACT ACTGGGGCCA GGGAAC 26 

(2) INFORMATION FOR SEQ ID NO: 334: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 05 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 34: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGAACGG ATTACGATAT TTTGACTGGT TATTATAACC CTTTTGACTA CTGGGGCCAG 300 
GGAAC 
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(2) INFORMATION FOR SEQ ID NO: 335: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 284 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGTATAG CAGCAGCCCT TTTTGACTAC TGGGGCCAGG GAAC 284 
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(2) INFORMATION FOR SEQ ID NO: 33 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 290 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 



TCTCTGAAGA 


TCTCCTGTAA 


GGGTTCTGGA 


TACAGCTTTA 


CCAGCTACTG 


GATCGGCTGG 


60 


GTGCGCCAGA 


TGCCCGGGAA 


AGGCCTGGAG 


TGGATGGGGA 


TCATCTATCC 


TGGTGACTCT 


120 


GATACCAGAT 


ACAGCCCGTC 


CTTCCAAGGC 


CAGGTCACCA 


TCTCAGCCGA 


CAAGTCCATC 


180 


AGCACCGCCT 


ACCTGCAGTG 


GAGCAGCCTG 


AAGGCCTCGG 


ACACCGCCAT 


GTATTACTGT 


240 


GCGAGATCTC 


TACTATGGTT 


CGGGGAGTTT 


GAC TACTGGG 


GCCAGGGAAC 
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(2) INFORMATION FOR SEQ ID NO: 3 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 05 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 



TCTCTGAAGA 


TCTCCTGTAA 


GGGTTCTGGA 


TACAGCTTTA 


CCAGCTACTG 


GATCGGCTGG 


60 


GTGCGCCAGA 


TGCCCGGGAA 


AGGCCTGGAG 


TGGATGGGGA 


TCATCTATCC 


TGGTGACTCT 


120 


GATACCAGAT 


ACAGCCCGTC 


CTTCCAAGGC 


CAGGTCACCA 


TCTCAGCCGA 


CAAGTCCATC 


180 


AGCACCGCCT 


ACCTGCAGTG 


GAGCAGCCTG 


AAGGCCTCGG 


ACACCGCCAT 


GTATTACTGT 


240 


GCGAGACTCG 


ATTACGATAT 


TTTGACTGGT 


TATTATAACC 


CCTTTGACTA 


CTGGGGCCAG 


300 


GGAAC 
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(2) INFORMATION FOR SEQ ID NO: 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 05 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:338: 
TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 
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GAT AC C AG AT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCGGCCGA CAAGTC CATC 



180 



AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 



240 



GCGAGACTCG ATTACGATAT TTTGACTGGT TATTATAACC CCTTTGACTA CTGGGGCCAG 



300 



GGAAC 
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(2) INFORMATION FOR SEQ ID NO: 33 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 60 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGCCCGT ATAGCAGCAG CTGGTACAGG TTTGACTACT GGGGCCAGGG AAC 2 93 



(2) INFORMATION FOR SEQ ID NO: 340: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 0: 
TACTACTACT ACTACGGTAT GGACGTCTGG GGCCAAGGGA C 41 



(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 180 
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AGCACCGCCT ACCTGCAGTG GAGCAGC CTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 
GCGAGAGAGG GGGGGGTTTA TTAC TATGGT TCGGGGAGTT ATTACTACTA CGGTATGGAC 
GTCTGGGGCC AAGGGAC 

(2) INFORMATION FOR SEQ ID NO:342: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 05 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 
TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 
GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 
GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 
AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 
GCGAGAGAGG GAACTGGGGA CCATTACTGC TACTACTACG GTATGGACGT CTGGGGCCAA 
GGGAC 

(2) INFORMATION FOR SEQ ID NO:343: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:343: 
TACTGGTACT TCGATCTCTG GGGCCGTGGC ACCCTGGTC 3 9 

(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 303 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:344: 
TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTCG TCAGCTACTG GATCGGCTGG 
GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 
GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 
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AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 



240 



GCGAGACAGG GAC TGGGGAT CGGGGAATGG TACTTCGATC TCTGGGGCCG TGGCACCCTG 



300 



GTC 



303 



(2) INFORMATION FOR SEQ ID NO: 34 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:345: 
GCTTTTGATA TCTGGGGCCA AGGGACAATG GTC 3 3 



(2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACGGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GAT AC C AG AT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGGTAAACT GGGATCGGGC TTTTGATATC TGGGGCCAAG GGACAATGGT C 2 91 



(2) INFORMATION FOR SEQ ID NO: 34 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid - 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 
TACTTTGACT ACTGGGGCCA GGGAACCCTG GTC 



33 
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(2) INFORMATION FOR SEQ ID NO: 34 8: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 297 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 



TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCGGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TACCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGTACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTTCTGT 24 0 

GCGAGACATA AGGCGGGGAT CAACTACTTT GCCTACTGGG GCCAGGGAAC CCTGGTC 2 97 



(2) INFORMATION FOR SEQ ID NO: 34 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 297 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE : DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 9: 



TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAACTTCA CCGGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TACCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGTACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTTCTGT 24 0 

GCGAGACATA AGGCGGGGAT CAACTACTTT GCCTACTGGG GCCAGGGAAC CCTGGTC 2 97 



(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 297 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAACTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCG TCTCAGCCGA CAAGTCCATC 18 0 
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AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGAGACT GGGGAGAAGG GTATTACTTT GACTACTGGG GCCAGGGAAC CCTGGTC 2 97 

(2) INFORMATION FOR SEQ ID NO: 351: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 282 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGTTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGCA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCGCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 180 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACAAC TGGTTGACTA CTGGGGCCAG GGAACCCTGG TC 2 82 

(2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 52: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA TCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACTGT ATTACTATGG TTCGGGGAGT TACCACAACT GGTTCGACCC CTGGGGCCAG 30 0 

GGAACCCTGG TC 312 

(2) INFORMATION FOR SEQ ID NO:353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:353: 
AACTGGTTCG ACCCCTGGGG CCAGGGAACC CTGGTC 36 

(2) INFORMATION FOR SEQ ID NO: 3 54: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:354: 

TCTCTGAAGA TCTCCTGTAA GGTTTCTGGA TACAACTTTG CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCACATATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AACACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACAGG GGTATTACTA TGGTTCGGCA AATTATTATA ACATTTGGTT CGACCCCTGG 300 

GGC CAGGGAA CCCTGGTC 318 

(2) INFORMATION FOR SEQ ID NO:355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 
TACTACTACT ACTACGGTAT GGACGTCTGG GGC CAAGGGA CCACGGTC 48 

(2) INFORMATION FOR SEQ ID NO: 356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 56: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TATAGTTTTG CCAACTACGG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCC 12 0 

GGTTCCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 180 

AACACCGCCT ACCTGCAATG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 



395 



GCGCCCCCGG CGTACTACTA CTACGGTATG GACGTCTGGG GCCAAGGGAC CACGGTC 297 



(2) INFORMATION FOR SEQ ID NO: 357: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 57: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTC 156 



(2) INFORMATION FOR SEQ ID NO: 358: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:358: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGTTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTC 156 



(2) INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:359: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA GCAGCTACTG GATCGGCTGG 60 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTC 156 
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(2) INFORMATION FOR SEQ ID NO: 36 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:360: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAACTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTC 156 



(2) INFORMATION FOR SEQ ID NO: 361: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCACCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTC 156 



(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 

TCTCTGAAGA TCTCCTGTAA GGGCTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCGAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTCGAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 240 

GCGAGACAGG GGGGGGATAG GTACTTCGAT CTCTGGGGCC GTGGCACCCT GGTC 2 94 
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(2) INFORMATION FOR SEQ ID NO: 363: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:363: 



TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACATT GGCTAAATGG GGATGCTTTT GATATC TGGG GCCAAGGGAC AATGGTC 29 7 



(2) INFORMATION FOR SEQ ID NO:364: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 64: 



TCTCTGAAGA 


TCTCCTGTAA 


GGGTTCTGGA 


TACAGCTTTA 


CCAGCTACTG 


GATCGGCTGG 


60 


GTGCGCCAGA 


TGCCCGGGAA 


AGGCCTGGAG 


TGGATGGGGA 


TCATCTATCC 


TGGTGACTCT 


120 


GATACCAGAT 


ACAGCCCGTC 


CTTCCAAGGC 


CAGGTCACCA 


TCTCAGCCGA 


CAAGTCCATC 


180 


AGCACCGCCT 


ACCTGCAGTG 


GAGCAGCCTG 


AGGGCCTCGG 


ACAGTGTCAT 


GTATTACTGT 


240 


GCGAGACGGG 


ATTACGATAT 


TTTGACTGGT 


TATTATGCGG 


CTTTTGATAT 


CTGGGGCCAA 


300 


GGGACAATGG 


TC 










312 



(2) INFORMATION FOR SEQ ID NO: 36 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365: 
TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGTTTTT CCGACTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 
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GACACCAGAT AGAGCCCGTC CTTCCAGGGC CAGGTCTCCA TCTCAGTCGA CAAGTCCATC 



180 



AACACCGCCT TCCTGCAGTG GAACACGCTG GAGGCTTCGG ACACCGCCAT GTATTACTGT 



240 



GCGAGAGGGT ATTATTATGA TTCGGGGACT TATTATAAGT CTACCCCTTT GACTATTGGG 



300 



GC C AGGGAAC CCTGGTC 



317 



(2) INFORMATION FOR SEQ ID NO:366: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGTTTTT CCGACTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GACACCAGAT ACAGCCCGTC CTTCCAGGGC CAGGTCTCCA TCTCAGTCGA CAAGTCCATC 18 0 

AACACCGCCT TCCTGCAGTG GAACACCCTG GAGGCTTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACTAA CTGGCCTCTT TAACTATTGG GGCCAGGGAA CCCTGGTC 28 8 



(2) INFORMATION FOR SEQ ID NO: 3 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 



TCTCTGAAGA 


TCTCCTGTAA 


GGGTTCTGGA 


TACAGCTTTA 


CCAGCTACTG 


GATCGGCTGG 


60 


GTGCGCCAGA 


TGCCCGGGAA 


AGGCCTGGAG 


TGGATGGGGA 


TCATCTATCC 


TGGTGACTCT 


120 


GATACCAGAT 


ACAGCCCGTC 


CTTCCAAGGC 


CAGGTCACCA 


TCTCAGCCGA 


CAAGTCCATC 


^ 180 


AGCACCGCCT 


ACCTGCAGTG 


GAGCAGCCTG 


AAGGCCTCGG 


ACACCGCCAT 


GTATTACTGT 


240 


GCGAGACATC 


TTTACTTTGA 


CTACTGGGGC 


CAGGGAACCC 


AGGTC 




285 



(2) INFORMATION FOR SEQ ID NO:368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: DNA (genomic) 



399 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:368: 



TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCCAATG GATCGGCTGG 



60 



GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTGGCC TGGTGACTCT 



120 



GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 



180 



AGTACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 



240 



GCGAGACAAG GGTTTGACTA CTGGGGCCAG GGAACCCTGG TC 



282 



(2) INFORMATION FOR SEQ ID NO:369: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 69: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGTTTTA GCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACATC TTTACTTTGA CTACTGGGGC CAGGGAACCC AGGTC 28 5 

(2) INFORMATION FOR SEQ ID NO:370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 282 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:370: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAACTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACAAA CTTTTGACTA CTGGGGCCAG GGAACCCTGG TC 2 82 
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(2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 03 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 71: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGGACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACATG GTATAGCAGC AGCTGGTACG TGGTTCGACC CCTGGGGCCA GGGAACCCTG 3 00 

GTC 303 

(2) INFORMATION FOR SEQ ID NO: 3 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:372: 

TCTCTGAAGA TCTCCTGTAG GGGTTCTGGA TACAGCTTTT CCAGTTACTG GATCGCCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GAAACCAGAT ACAGTCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACAGG GCTACTTTGA CTACTGGGGC CAGGGAACCC TGGTC 285 

(2) INFORMATION FOR SEQ ID NO:373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 73: 
TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 
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GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 180 
AGTACCGCCT ACCTGCAGTG GAGCAGCCTG AGGGCCTCGG ACACCGCCAT TTATTACTGT 24 0 

GCGAGACATC TTTACTTTGA CTACTGGGGC CAGGGAACCC AGGTC 285 



(2) INFORMATION FOR SEQ ID NO: 3 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 297 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 



TCTCTGAAGA 


TCTCCTGTAA 


GGGTTCTGGA 


TACAGCTTTA 


CCAGCTACTG 


GATCGGCTGG 


60 


GTGCGCCAGA 


TGCCCGGGAA 


AGGCCTGGAG 


TGGATGGGGA 


TCATCTATCC 


TGGTGACTCT 


120 


GATACCAGAT 


ACAGCCCGTC 


CTTCCAAGGC 


CAGGTCACCA 


TCTCAGCCGA 


CAAGTCCATC 


180 


AGCACCGCCT 


ACCTGCAGTG 


GAGCAGCCTG 


AAGGCCTCGG 


ACACCGCCAT 


GTATTACTGT 


240 


GCGGCCGGGT 


AT AC C AGC AG 


CTGGTTCTTT 


GACTTCTGGG 


GCCAGGGAAC 


CCTGGTC 


297 



(2) INFORMATION FOR SEQ ID NO: 3 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATCGCTGT 24 0 

GCGAGACATC TTTACTTTGA CTACTGGGGC CAGGGAACCC AGGTC 28 5 



(2) INFORMATION FOR SEQ ID NO: 376: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 



TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTC CCATCTACTG GATCGGCTGG 



60 



GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 



120 



GAT AC C AG AT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 



180 



AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 



240 



GCGAGAGTGG TTCGGGGATT TATTATTTAC TTTGACTACT GGGGCCAGGG AACCCTGGTC 



300 



(2) INFORMATION FOR SEQ ID NO: 377 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 

TCTCTGAAGA TCTCCTGTAA GGTTTCTGGA TACAGCTTAA CCAGTTATTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACAAA GGGGTACTTT GACTACTGGG GCCAGGGAAC CCTGGTC 28 7 



(2) INFORMATION FOR SEQ ID NO: 3 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:378: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 60 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGGGGAT CGTGGTACTT TGACTACTGG GGCCAGGGAA CCCTGGTC 28 8 



(2) INFORMATION FOR SEQ ID NO: 379: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 79: 



TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAACTTTA CCACCTACTG GATCGGCTGG 



60 



GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 



120 



GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCGTC 



180 



AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 



240 



GCGAGACTCC CCAATGACAG TTGGTTCGAC CCCTGGGGCC AGGGAACCCT GGTC 



294 



(2) INFORMATION FOR SEQ ID NO: 380: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:380: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACGGG GGTACTATGG TTCGGGGAGT TATTATAACT GGTTCGACCC CTGGGGCCAG 3 00 

GGAACCCTGG TC 312 

(2) INFORMATION FOR SEQ ID NO: 3 81: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAACTTTA TCACCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TACCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATCC TGGTGACTCT 12 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACATG AGCAGCTGGT ACAGGGTTAC TACTACTACG GTATGGACGT CTGGGGCCAA 3 00 

GGGACCACGG TC 312 
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(2) INFORMATION FOR SEQ ID NO: 382: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 9 base pairs 

(B) TYPE : nucleic acid 

(C) STRAHDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:382: 

TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACACCTTTA CCAGTTACTG GATCGCCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTATC C TGGTGACTCT 12 0 

GAT AC C AG AT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA CAAGTCCATC 18 0 

AGCACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGAGATA TGGGGGGGGC CTCCTACTTC TACTTCGGTA TGGACGTCTG GGGCCAAGGG 3 00 

ACCACGGTC 3 09 



(2) INFORMATION FOR SEQ ID NO: 383: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRA3STDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 



TCTCTGAAGA 


TCTCCTGTAA 


GGGTTCTGGA 


TACAGCTTTG 


CCAACTACTG 


GATCGGCTGG 


60 


GTGCGCCAGA 


TGCCCGGGAA 


AGGCCTGGAG 


TGGATGGGGA 


TCATCTTTCC 


TGGTGACTCT 


120 


GATACCAGAT 


ACAGCCCGTC 


CTTCCAAGGC 


CAGGTCACCA 


TCTCAGCCGA 


CAACTCCATC 


180 


AGCACCGCCT 


ACCTGCAGTG 


GAGCAGCCTG 


AAGGCCTCGG 


ACACCGCCAT 


GTATTACTGT 


240 


GCGAGACACC 


ACGACTACTA 


CGGTATGGAC 


GTCTGGGGCC 


AAGGGACCAC 


GGTC 
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(2) INFORMATION FOR SEQ ID NO: 3 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:384: 
TCTCTGAAGA TCTCCTGTAA GGGTTCTGGA TACAGCTTTA CCAGCTACTG GATCGGCTGG 6 0 

GTGCGCCAGA TGCCCGGGAA AGGCCTGGAG TGGATGGGGA TCATCTTTCC TGGTGACTCT 12 0 
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GATACCAGAT ACAGCCCGCC CTTCCAAGGC CAGGTCACCA TCTCAGCCGA GAAGTCCATC 180 

AACACCGCCT ACCTGCAGTG GAGCAGCCTG AAGGCCTCGG ACACCGCCAT GTATTACTGT 24 0 

GCGAGACGCT ACTACGGTAT GGACGTCTGG GGCCAAGGGA CCACGGTC 288 

(2) INFORMATION FOR SEQ ID NO: 385: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 

TCCCTGAGAC TCTCCTGTGC AGCCTCTGGA TTCACCTTCA GTAGCTATGC TATGCACTGG 60 

GTCCGCCAGG CTCCAGGCAA GGGGCTGGAG TGGGTGGCAG TTATATCATA TGATGGAAGC 12 0 

AATAAATACT ACGCAGACTC CGTGAAGGGC CGATTCACCA TCTCCAGAGA CAATTCCAAG 18 0 

AACACGCTGT ATCTGCAAAT GAACAGCCTG AGAGCTGAGG ACACGGCTGT GTATTACTGT 24 0 

GCGAGA 246 

(2) INFORMATION FOR SEQ ID NO:386: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D } TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 86: 

TCCCTGAGAC TCTCCTGTGC AGCCTCTGGA TTCACCTTCA GTAGCTATGC TATGCACTGG 6 0 

GTCCGCCAGG CTCCAGGCAA GGGGCTGGAC TGGGTGGCAG TTATATCATA TGATGGAACC 12 0 

AATAAATACG ACGCAGACTC CGTGAAGGGC CGATTCACCA TCTCCAGAGA CAATTCCAAG 18 0 

AACACGCTGT ATCTGCAAAT GAACAGCCTG AGATCTGAGG ACACGGCTGT GTATTACTGT 240 

GCGAGAGAGT CTTCCGGCTG GTACTTCGAT TTCTGGGGCC GTGGCACCCT GGTC 2 94 

(2) INFORMATION FOR SEQ ID NO: 3 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 7: 



TCCCTGAGAC TCTCCTGTGC AGCCTCTGGA TTCACCTTCA 


GTAGCTATGC 


TATGCACTGG 


60 


GTCCGCCAGG CTCCAGGCAA GGGGCTGGAC TGGGTGGCAG 


TTATATCATA 


TGATGGAACC 


120 


AATAAATACG ACGCAGACTC CGTGAAGGGC CGATTCACCA 


TCTC CAGAGA 


CAATTC CAAG 


180 


AACACGCTGT ATCTGCAAAT GAACAGCCTG AGATCTGAGG 


ACACGGCTGT 


GTATTACTGT 


240 


GCGAGGAAGT CTTCCGGCTG GTACTTCGAT TTCTGGGGCC 


GTGGCACCCT 


GGTC 


294 


(2) INFORMATION FOR SEQ ID NO: 388: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 








(ii) MOLECULE TYPE: DNA (genomic) 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO:388: 






TCCCTGAGAC TCTCCTGTGC AGCCTCTGGA TTCACCTTCA 


ATAACTGTAC 


TATACACTGG 


60 


GTCCGCCAGG CTCCAGGCAA GGGGCTGGAC TGGGTGGCAG 


TTATATCATA 


TGATGGAGCC 


120 


AATAAATACG ACGCAGAGTC CGTGAAGGGC CGATTCACCA 


TCTC CAGAGA 


CAATTCCAAG 


180 


AACATGCTGT ATCCGCAAAT GAACAGCCTG AGATCTGAGG 


ACACGGCTGT 


GTATTACTGT 


240 


GCGAGAGAGT CCTCCGGCTG GTACTTCGAT CTTTGGGGCC 


GTGGCACCCT 


GGTC 


294 


(2) INFORMATION FOR SEQ ID NO:389: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 294 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 








(ii) MOLECULE TYPE: DNA (genomic) 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 






TCCCTGAGAC TCTCCTGTGC AGCCTCTGGA TTCACCTTCA 


ATAACTGTAC 


TATACACTGG 


60 


GTCCGCCAGG CCCCAGGCAA GGGGCTGGAC TGGGTGGCAG 


TTATATCATA 


TGATGGAGCC 


120 


AATAAATACG ACGCAGACTC CGTGAAGGGC CGATTCACCA 


TCTCCAGAGA 


CAATTCCAAG 


180 


AACACGCTGT ATCTGCAAAT GAACAGCCTG GGATC TGAGG 


ACACGGCTGT 


GTATTACTGT 


240 


GCGAGCGAGT CCTCCGGCTC TTACTTCGAT CTCTGGGGCC 


GTGGCACCCT 


GGTC 


294 
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(2) INFORMATION FOR SEQ ID NO: 3 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 

TCCCTGAGAC TCTCCTGTGC AGCCTCTGGA TTCACCTTCA GTAGCTATGC TATGCACTGG 60 

GTCCGCCAGG CTCCAGGCAA GGGGCTGGAC TGGGTGGCAA TTATGTCATA TGATGGAACC 120 

AATAAATTCG ACGCAGACTC CGTGAAGGGC CGATTCACCA TCTCCAGAGA CAATTCCAAG 18 0 

AACACGCTGT ATCTGCAAAT GAACAGC CTG AGATCTGAGG ACACGGCTGT GTATTACTGT 24 0 

GCGAGAGAGT CTTCCGGCTG GTACTTCGAT CTCTGGGGCC GTGGCACCCT GGTC 2 94 

(2) INFORMATION FOR SEQ ID NO: 391: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 91: 

TCCCTGAGAC TCTCCTGTGC AGCCTCTGGA TTCACCTTCA ATAACTGTAC TCTACACTGG 6 0 

GTCCGCCAGG CTCCAGGCAA GGGGCTGGAC TGGGTGTCAG TTATATCATA TGATGGAGCC 12 0 

AATAAATACG ACGCAGACTC CGTGAAGGGC CGATTCACCA TCTCCAGAGA CAATTCCAAG 18 0 

AACACGCTGT ATCTGCAAAT GAACAGC CTG AGATCTGAGG ACACGGCTGT GTATTACTGT 24 0 

GCGAGAGAGT CCTCCGGCTG GTACTTCGAT CTCTGGGGCC GTGGCACCCT GGTC 2 94 

(2) INFORMATION FOR SEQ ID NO: 3 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:392: 

TCCCTGAGAC TCTCCTGTGC AGCCTCTGGA TTCACCTTCA GTAGCTATGC TATGCACTGG 6 0 

GTCCGCCAGG CTCCAGGCAA GGGGCTGGAC TGGGTGGCAG TTTTTTCATA TGATGGAACC 12 0 

AATAAATACG ACGCAGACTC CGTGAAGGGC CGATTCACCA TCTCCAGAGA CAATTCCAAG 18 0 
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AACACGCTGT ATCTGCAAAT GAACAGCGTG AGATCTGAGG ACACGGCTGT TTATTACTGT 24 0 

GCGAGAGAGT CTTCCGGCTG GTACTTCGAT TTCTGGGGCC GTGGCACCCT GGTC 2 94 

(2) INFORMATION FOR SEQ ID NO: 3 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 93: 

TCCCTGAGAC TCTCCTGTGC AGCCTCTGGA TTCACCTTCA GTAACTATAC TATGCACTGG 6 0 

GTCCGCCAGG CTCCAGGCAA GGGGCTGGAC TGGGTGGCAG TTTTTTCATA TGATGGAACC 12 0 

AATAAATACG ACGCAGACTC CGTGAAGGGC CGATTCACCA TCT CCAGAGA CAATTCCAAG 180 

AACACGCTGT ATCTGCAAAT GAACAGC CTG AGATCTGAGG ACACGGCTGT TTATTACTGT 24 0 

GCGAGAGAGT CTTCCGGCTG GTACTTCGAT TTCTGGGGCC GTGGCACCCT GGTC 2 94 

(2) INFORMATION FOR SEQ ID NO:394: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394: 
TCCCTGAGAC TCTCCTGTGC AGCCTCTGGA TTCACCTTCA ATAACTGTAC TATACACTGG 6 0 

GTCCGCCAGG CTCCAGGCAA GGGGCTGGAC TGGGTGGCAG TT AT AT CAT A TGATGGAGCC 12 0 

AATAAATACG ACGCAGACTC CGTGAAGGGC CGATTCACCA TCTCCAGAGA CAAATCCAAG 18 0 

AACACGCTGT ATCTGCAAAT GAGCAGCCTG AGATCTGAAG ACACGGCTGT ATATTACTGT 24 0 

GTGAGAGAGT CCTCCGGCTG GTACTTCGAT CTCTGGGGCC GTGGCACCCT GGTC 2 94 

(2) INFORMATION FOR SEQ ID NO:395: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:395: 
TCCCTGAGAC TCTCCTGTGC AGCCTCTGGA TTCACCTTCA GTAGCTATGC TATGCACTGG 



60 
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GTCCGCCAGG CTCCAGGCAA GGGGCTGGAG TGGGTGGCAG TTATATCATA TGATGGAAGC 



120 



AATAAATACT ACGCAGACTC CGTGAAGGGC CGATTCACCA TCTCCAGAGA CAATTCCAAG 



180 



AACACGCTGT ATCTGCAAAT GAACAGCCTG AGAGCTGAGG ACACGGCTGT GTATTACTGT 



240 



GCGAGAGGAT GGTTCGGGGA GTTATGGGAC TACTGGGGCC AGGGAACCCT GGTC 



294 



(2) INFORMATION FOR SEQ ID NO: 396: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 



TCCCTGAGAC 


TCTCCTGTGC 


AGCCTCTGGA 


TTCACCTTCA 


GCAGCTATGC 


TATGCACTGG 


60 


GTCCGCCAGG 


CTCCAGGCAA 


GGGGCTGGAG 


TGGGTGGCAG 


TTATATCATA 


TGATGGAAGC 


120 


AATAAATACT 


ACGCAGACTC 


CGTGAAGGGC 


CGATTCACCA 


TCTCCAGAGA 


CAATTCCAAG 


180 


AACACGCTGT 


ATCTGCAAAT 


GAACAGCCTG 


GGAGCTGAGG 


ACACGGCTGT 


GTATTACTGT 


240 


GCGAGAGAGA 


GTCTCTTAAC 


TGGGGACTTT 


GACTACTGGG 


GCCAGGGAAC 


CCTGGTC 


297 



(2) INFORMATION FOR SEQ ID NO:397: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 97: 



TCCCTGAGAC 


TCTCCTGTGC 


AGCCTCTGGA 


TTCACCTTCA 


GTAGCTATGC 


TATGCACTGG 


60 


GTCCGCCAGG 


CTCCAGGCAA 


GGGGCTGGAG 


TGGGTGGCAG 


TTATATCATA 


TGATGGAAGC 


120 


AATAAATACT 


ACGCAGACTC 


CGTGAAGGGC 


CGATTCACCA 


TCTCCAGAGA 


CAATTCCAAG 


180 


AACACGCTGT 


ATCTGCAAAT 


GAACAGCCTG 


AGAGCTGAGG 


ACACGGCTGT 


GTATTACTGT 


240 


GCGAGAGGAT 


GGTTCGGGGA 


GTTATGGGAC 


TACTGGGGCC 


AGGGAACCCT 


GGTC 


294 



(2) INFORMATION FOR SEQ ID NO:398: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398: 

TCCCTGAGAC TCTCCTGTGC AGCCTCTGGA TTCACCTTCA GTAGCTATGC TATGCACTGG 6 0 

GTCCGCCAGG CTCCAGGCAA GGGGCTGGAG TGGGTGGCAG TTATATCATA TGATGGAAGC 12 0 

AATAAATACA ACGCAGACTC CGTGAAGGGT CGATTCACCA TCTC CAGAGA CAACTCCAAG 180 

AACACGCTGT ATCTGCAAAT GAACAGCCTG AGAGCTGAGG ACACGGCTGT GTATTACTGT 24 0 

GCGAGAGGGT ACGATATTTT GACTGGTTAT TATGACCCGC TCTTTGACAA CTGGGGCCAG 3 00 

GGAACCCTGG TC 312 

(2) INFORMATION FOR SEQ ID NO: 3 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: 

TCCCTGAGAC TCTCCTGTGC AGCCTCTGGA TTCACCTTCA GTAGCTATAC TATGCACTGG 6 0 

GTCCGCCAGG CTCCAGGCAA GGGGCTGGAG TGGGTGGCAG TTATATCATA TGATGGAAGC 12 0 

AATAAATACT ACGCAGACTC CGTGAAGGGC CGATTCACCA TCTCCAGAGA CAATTCCAAG 18 0 

AACACGCTGT ATCTGCAAAT GAACAGCCTG AGAGCTGAGG ACACGGCTGT GTATTACTGT 24 0 

GCGAGAGGGT ACGATATTTT GACTGGTTAT TATGACCCGC TCTTTGACAA CTGGGGCCAG 3 00 

GGAACCCTGG TC 312 

(2) INFORMATION FOR SEQ ID NO: 40 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:400: 

TCCCTGAGAC TCTCCTGTGC AGCCTCTGGA TTCACCTTCA GTAGCTATGC TATGCACTGG 6 0 

GTCCGCCAGG CTCCAGGCAA GGGGCTGGAG TGGGTGGCAG TTATATCATA TGATGGAAGC 12 0 

AATAAATACT ACGCAGACTC CGTGAAGGGC CGATTCACCA TCTCCAGAGA CAGTTCCAAG 18 0 

AACACGCTGT ATCTGCAAAT GAACAGCCTG AGAGCTGAGG ACACGGCTGT GTATTACTGT 24 0 

GCGAGAGATC AGGCGGCGTA TAGTGGCTAC GGGTCGGGGG GTATGGACGT CTGGGGCCAA 3 00 

GGGACCACGG TC 312 
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(2) INFORMATION FOR SEQ ID NO:401: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 01: 



TCCCTGAGAC 


TCTCCTGTGC 


AGCCTCTGGA 


TTCACCTTCA 


GTAGCTATAC 


TATGCACTGG 


60 


GTCCGCCAGG 


CTCCAGGCAA 


GGGGCTGGAG 


TGGGTGGCAG 


TTATATCATA 


TGATGGAAGC 


120 


AATAAATACT 


ACGCAGACTC 


CGTGAAGGGC 


CGATTCACCA 


TCTC CAGAGA 


CAATTC CAAG 


180 


AACACGCTGT 


ATCTGCAAAT 


GAACAGCCTG 


AGAGCTGAGG 


ACACGGCTGT 


GTATTACTGT 


240 


GCGAGCCATT 


ACTATGGTTC 


GGGGAGTTAT 


AGCTACTACG 


GTATGGACGT 


CTGGGGCCAA 


300 


GGGACCACGG 


TC 
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(2) INFORMATION FOR SEQ ID NO:402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:402: 



TCCCTGAGAC 


TCTCCTGTGC 


AGCCTCTGGA 


TTCACCTTCA 


GTAGCTATAC 


TATGCACTGG 


60 


GTCCGCCAGG 


CTCCAGGCAA 


GGGGCTGGGG 


TGGGTGGCAG 


TTATATCATA 


TGATGGAAGC 


120 


AATAAATACT 


ACGCAGACTC 


CGTGAAGGGC 


CGATTCACCA 


TCTC CAGAGA 


CAATTCCAAG 


180 


AACACGCTGT 


ATCTGCAAAT 


GAACAGCCTG 


AGAGCTGAGG 


ACACGGCTGT 


GTATTACTGT 


240 


GCGAGCCATT 


ACTATGGTTC 


GGGGAGTTAT 


AGCTACTACG 


GTATGGACGT 


CTGGGGCCAA 


300 


GGGACCACGG 


TC 










312 



(2) INFORMATION FOR SEQ ID NO: 4 03: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 06 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:403: 
TCCCTGAGAC TCTCCTGTGC AGCCTCTGGA TTCACCTTCA GTAGCTATGC TATGCACTGG 



60 
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GTCCGCCAGG CTCCAGGCAA GGGGCTGGAG TGGGTGGCAG TTATATTATA TGACGAAAGC 12 0 

AATAAATATT ACGCAGACTC CGTGAAGGGC CGAATCACCA TCTCCAGAGA CAATTC CAAG 180 

AACACGCTGT ATCTGCAAAT GAACAGCCTG AGAGC TGAGG ACACGGCTGT GTATTACTGT 24 0 

GCGAGAGAGG GGACTACGTA CTACTACTAC TACGGTATGG ACGTCTGGGG CCAAGGGACC 300 

ACGGTC 306 

(2) INFORMATION FOR SEQ ID NO: 4 04: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:404: 

TCGGTGAAGG TCTCCTGCAA GGCTTCTGGA GGCACCTTCA GCAGCTATGC TATCAGCTGG 6 0 

GTGCGACAGG CCCCTGGACA AGGGCTTGAG TGGATGGGAA GGATCATCCC TATCCTTGGT 12 0 

ATAGCAAACT ACGCACAGAA GTTCCAGGGC AGAGT CACGA TTACCGCGGA CAAATCCACG 18 0 

AGCACAGCCT ACATGGAGCT GAGCAGCCTG AGATCTGAGG ACACGGCCGT GTATTACTGT 24 0 

GCGAGA 246 

(2) INFORMATION FOR SEQ ID NO: 4 05: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 00 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405: 

TCGGTGAAGG TCTCCTGCAA GGCTTCTGGA GGCACCTTCA GCACCTATGC TATCACCTGG 60 

GTGCGACAGG CCCCTGGACA AGGGCTTGAG TGGATGGGAA AGATCATCCC TATCTTTGGT 12 0 

ATAGCAAACT ACGCACAGAA GTTCCAGGGC AGAGTCACGA TTACCGCGGA CAAATCCACG 18 0 

AGCACAGCCT ACATGGAGCT GACCAGCCTG AGATCTGAGG ACACGGCCGT GTATTACTGT 24 0 

GCGAGAGACG AGACTGGGGA TCTCGGTGCT TTTGATATCT GGGGCCAAGG GACAATGGTC 3 00 

(2) INFORMATION FOR SEQ ID NO: 406: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:406: 

ACCCTGTCCC TCACCTGCGC TGTCTATGGT GGGTCCTTCA GTGGTTACTA CTGGAGCTGG 60 

ATCCGCCAGC CCCCAGGGAA GGGGCTGGAG TGGATTGGGG AAATCAATCA TAGTGGAAGC 12 0 

ACCAACTACA ACCCGTCCCT CAAGAGTCGA GTCAC CAT AT CAGTAGACAC GTCCAAGAAC 180 

CAGTTCTCCC TGAAGCTGAG CTCTGTGACC GCCGCGGACA CGGCTGTGTA TTACTGTGCG 24 0 
AGA 



243 



(2) INFORMATION FOR SEQ ID NO: 407: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 82 base pairs 

(B) TYPE: nucleic acid 

(C) S TRANDEDNE S S : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:407: 

ACCCTGTCCC TCACCTGCGC TGTCTATGGT GGGTCCTTCA GTGGTTACTA CTGGAGCTGG 60 

ATCCGCCAGC CCCCAGGGAA GGGGCTGGAG TGGATTGGGG AAATCAATCA TAGTGGAAGC 12 0 

ACCAACTACA ACCCGTCCCT CAAGAGTCGA GTCACCATAT CAGTAGACAC GTCCAAGAAC 18 0 

CAGTTCTCCC TGAAGCTGAG CTCTGTGACC GCCGCGGACA CGGCTGTGTA TTACTGTGCG 24 0 

AGGAACTTAT TTTTTGACTA CTGGGGCCAG GGAACCCTGG TC 2 82 

(2) INFORMATION FOR SEQ ID NO: 4 08: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:408: 

ACCCTGTCCC TCACCTGCGC TGTCTATGGT GGGTCCTTCA GTGGTTACTA CTGGAGCTGG 6 0 

ATCCGCCAGC CCCCAGGGAA GGGGCTGGAG TGGATTGGGG AAATCAATCA TAGTGGAAGC 12 0 

ACCAACTACA ACCCGTCCCT CAAGAGTCGA GTCACCATAT CAGTAGACAC GTCCAAGAAC 18 0 

CAGTTCTCCC TGAAGCTGAG CTCTGTGACC GCCGCGGACA CGGCTGTGTA TTACTGTGCG 24 0 

AGGGCAGCTA ACTGGTTTGA CTACTGGGGC CAGGGAACCC TGGTC 28 5 
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(2) INFORMATION FOR SEQ ID NO: 409: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 297 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409: 
ACCCTGTCCC TCACCTGCGC TGTCTATGGT GGGTCCTTCA GTGGTTACTA CTGGAGCTGG 
ATCCGCCAGC CCCCAGGGAA GGGGCTGGAG TGGATTGGGG AAATCAATCA TAGTGGAAGC 
ACCAACTACA ACCCGTCCCT CAAGAGTCGA GTCAC CAT AT CAGTAGACAC GTCCAAGAAC 
CAGTTCTCCC TGAAGCTGAG CTCTGTGACC GCCGCGGACA CGGCTGTGTA TTACTGTGCA 
GAGAGTGAGG GATGGGGATG GGACTACTTT GACTACTGGG GCCAGGGAAC CCTGGTC 
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TTTTCTGGCC TGACAACCAG GGTGGCGCAG GATGCTCAGT GCAGAGAGGA 50 
AGAAGCAGGT GGTCTCTGCA GCTGGAAGCT CAGCTCCCAC CCAGCTGCTT 100 



TGCATGTCCC TCCCAGCTGC CCTACCTTCC AGAGCCGATA TCAATGCCTG 150 



TGTCAGAGCC CTGGGGAGGA ACTGCTCAGT TAGGACCCAG AGGGAACCAT 200 

Me 

GGAAGCCCCA GCTCAGCTTC TCTTCCTCCT GCTACTCTGG CTCCCAGgtg 250 
tGluAlaPro AlaGlnLeuL euPheLeuLe uLeuLeuTrp LeuPro 
agggggaacc atgaggtggt tttgcacatt agtgaaaact cttgccacct 300 

ctgctcagca agaaatataa ttaaaattca aagtatatca acaattttgg 350 

ctctactcaa agacagttgg tttgatcttg attacatgag tgcatttctg 400 

ttttatttcc aatttcagAT ACCACCGGAG AAATTGTGTT GACACAGTCT 450 

Asp ThrThrGlyG luIleValLe uThrGlnSer 
CCAGCCACCC TGTCTTTGTC TCCAGGGGAA AGAGCCACCC TCTCCTGCAG 500 
ProAlaThrL euSerLeuSe rProGlyGlu ArgAlaThrL euSerCysAr 
GGCCAGTCAG AGTGTTAGCA GCTACTTAGC CTGGTACCAA CAGAAACCTG 550 
gAlaSerGln SerValSerS erTyrLeuAl aTrpTyrGln GlnLysProG 
GCCAGGCTCC CAGGCTCCTC ATCTATGATG CATCCAACAG GGCCACTGGC 600 
lyGlnAlaPr oArgLeuLeu IleTyrAspA laSerAsnAr gAlaThrGly 
ATCCCAGCCA GGTTCAGTGG CAGTGGGTCT GGGACAGACT TCACTCTCAC 650 
IleProAlaA rgPheSerGl ySerGlySer GlyThrAspP heThrLeuTh 
CATCAGCAGC CTAGAGCCTG AAGATTTTGC AGTTTATTAC TGTCAGCAGC 700 
rlleSerSer LeuGlu ProG luA spPheAl aVal TyrTyr CysG lnGlnA 



GTAGCAACTG GCCTC CjCACA GTG|ATTCCAC ATGA/jACAAA AACC CCAACA 750 
rgSerAsnTr pPro 

AGACCATCAG TGTTTACTAG ATT ATT AT AC CAGCTGCTTC CTTTACAGAC 800 

AGCTAGTGGG GT 812 
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AGGGCGGCGC AGATGCTCAG TGCAGAGAGA AGAAACAGGT GGTCTCTGCA 50 
GCTGGAAGCT CAGCTCCCAC CCCAGCTGCT TTGCATGTCC CTCCCAGCTG 100 



CCCTACCTTC CAGAGCCCAT ATCAATGCCT GGGTCAGAGC TCTGGGGAGG 150 



AACTGCTCAG TTAGGACCCA GACGGAACCA TGGAAGCCCC AGCGCAGCTT 200 

M etGluAlaPr oAlaGlnLeu 

CTCTTCCTCC TGCTACTCTG GCTCACAGgt gaggggaata tgaggtgtct 250 
LeuPheLeuL euLeuLeuTr pLeuThr 

ttgcacatca gtgaaaactc ctgccacctc tgctcagcaa gaaotataat 300 

taaaattcaa aatagatcaa caattttggc tctactcaaa gacagtgggt 350 

ttgattttga ttacatgagt gcatttctgt tttatttcca atttcagATA 400 

AspT 

CCACCGGAGA AATTGTGTTG ACACAGTCTC CAGCCACCCT GTCTTTGTCT 450 
hrThrGlyGl uIleValLeu ThrGlnSerP roAlaThrLe uSerLeuSer 
CCAGGGGAAA GAGCCACCCT CTCCTGCAGG GCCAGTCAGG GTGTTAGCAG 500 
ProGlyGluA rgAlaThrLe uSerCysArg AlaSerGlnG lyValSerSe 
CTACTTAGCC TGGTACCAGC AGAAACCTGG CCAGGCTCCC AGGCTCCTCA 550 
rTyrLeuAla TrpTyrGlnG InLysProGl yGlnAlaPro ArgLeuLeuI 
TCTATGATGC ATCCAACAGG GCCACTGGCA TCCCAGCCAG GTTCAGTGGC 600 
leTyrAspAl aSerAsnArg AlaThrGlyl XeProAlaAr gPheSerGly 
AGTGGGCCTG GGACAGACTT CACTCTCACC ATCAGCAGCC TAGAGCCTGA 650 
SerGlyProG XyThrAspPh eThrLeuThr IleSerSerL euGluProGl 



AGATTTTGCA GTTTATTACT GTCAGCAGCG TAGCAACTGG CATCCjCACAGj 700 
uAspPheAla ValTyrTyrC ysGlnGlnAr gSerAsnTrp His 



TGATTCCACA TGAAjACAAAA ACCj CCAACAA GACCATCAGT GTTTACTAGA 750 

TTATTATACC AGCTGCTTCC TTTACAGACA GCTAGTGGGG TGGCCACTCA 800 

GTGTTAGCAT CTCAGCTCTA TTTGGCCATT TTGGAGTTCA AGTTGTCAAG 850 

TCCAAAATTA CTTATGTTAG TCCATTGCAT CATACCATTT CAGTGTGGCT 900 
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CCGCCCCAGC TGCTTTGCAT GTCCCTCCCA GCCGCCCTGC AGTCCAGAGC 



50 



CCATATCAATI GCCTGGGTCA GAGCTCTGGA GAAGAGCTGC TCAGTTAGGA 100 



ACCCCAGAGG GAACCATGGA AACCCCAGCG CAGCTTCTCT TCCTCCTGCT 

MetGl uThrProAla GlnLeuLeuP heLeuLeuLe 
ACTCTGGCTC CCAGgtgagg ggaacatggg atggttttgc atgtcagtga 
uLeuTrpLeu Pro 

aaaccctctc aagtcctgtt acctggcaac tctgctcagt caatacaata 
attaaagctc aatataaagc aataattctg gctcttctgg gaagacaatg 
ggtttgattt agattacatg ggtgactttt ctgttttatt tccaatctca 



gATACCACCG 
AspThrThrG 
GTCTCCAGGG 
uSerProGly 
GCAGCAGCTA 
erSerSerTy 
CTCCTCATCT 
LeuLeuIleT 
CAGTGGCAGT 
eSerGlySer 
AGCCTGAAGA 
luProGluAs 



CCCACAGTGA 



GAGAAATTGT 
lyGluIleVa 
GAAAGAGCCA 
GluArgAlaT 
CTTAGCCTGG 
rLeuAlaTrp 
ATGGTGCATC 
yrGlyAlaSe 
GGGTCTGGGA 
GlySerGlyT 
TTTTGCAGTG 
pPheAlaVal 
TTCAGCTTGA 



GTTGACGCAG 
ILeuThrGln 
CCCTCTCCTG 
hrLeuSerCy 
TACCAGCAGA 
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CAGCAGGGCC 
rSerArgAla 
CAGACTTCAC 
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TATTACTGTC 
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AACAAAAACC 



TCTCCAGGCA 
SerProGlyT 
CAGGGCCAGT 
sArgAlaSer 
AACCTGGCCA 
ysProGlyGl 
ACTGGCATCC 
ThrGlylleP 
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rLeuThrlle 
AGCAGTATGG 
InGlnTyrGl 
TCTGCAAGAC 



CCCTGTCTTT 
hrLeuSerLe 
CAGAGTGTTA 
GlnSerValS 
GGCTCCCAGG 
nAlaProArg 
CAGACAGGTT 
roAspArgPh 
AGCAGACTGG 
SerArgLeuG 
TAGCTCACCT 
ySerSerPro 
CTTCATTGTT 



TACTAGATTA TACCAGCTGC TTCCTTTACA GATAGCTGCT GCAATGACAA 
CTCAATTTAG CATCTCTCTC TGCTTGGGCA TTTTGGGGAT CTTAAAAAAG 
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AAGATGGACA GCCAGGTTTA AGCACAGTTT CACAGTAATG GCCACTGGAT 
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AAACACATTC TCTGCAGACA AATTTGAGCT ACCTTGATCT TACCTGGACA 50 

GGTGGGGACA CTGAGCTGGT GCTGAGTTAC TCAGATGCGC CAGCTCTGCA 100 

GCTGTGCCCA GCCTGCCCCA TCCCCTGCTC ATTTGCATGT TCCCAGAGCA 150 



CAACCTCCTG CCCTGAAGCC lTTATTAA"fl AG GCTGGTCAGA CTTTGTGCAG 200 



GAATCAGACC CAGTCAGGAC ACAGCATGGA CATGAGGGTC CTCGCTCAGC 250 

MetAs pMetArgVal LeuAlaGlnL 
TCCTGGGGCT CCTGCTGCTC TGTTTCCCAG gtaaggatgg agaacactag 300 
euLeuGlyLe uLeuLeuLeu CysPhePro 

cagtttactc agcccagggt gctcagtact gctttactat tcagggaaat 350 



tctcttacaa catgattaat tgtgtggaca tttgttttta tgtttccaat 



400 



ctcagGTGCC AGATGTGACA TCCAGATGAC CCAGTCTCCA TCCTCACTGT 450 

GlyAla ArgCysAspI leGlnMetTh rGlnSerPro SerSerLeuS 
CTGCATCTGT AGGAGACAGA GTCACCATCA CTTGTCGGGC GAGTCAGGGT 500 
erAlaSerVa IGlyAspArg ValThrlleT hrCysArgAl aSerGlnGly 
ATTAGCAGCT GGTTAGCCTG GTATCAGCAG AAACCAGAGA AAGCCCCTAA 550 
IleSerSerT rpLeuAlaTr pTyrGlnGln LysProGluL ysAlaProLy 
GTCCCTGATC TATGCTGCAT CCAGTTTGCA AAGTGGGGTC CCATCAAGGT 600 
sSerLeuIle TyrAlaAlaS erSerLeuGl nSerGlyVal ProSerArgP 
TCAGCGGCAG TGGATCTGGG ACAGATTTCA CTCTCACCAT CAGCAGCCTG 650 
heSerGlySe rGlySerGly ThrAspPheT hrLeuThrll eSerSerLeu 
CAGCCTGAAG ATTTTGCAAC TTATTACTGC CAACAGTATA ATAGTTACCC 700 
Gln ProGluA spPheAlaTh rTyrTyrCys GlnGlnTyrA snSerTyrPr 
ACCtACAGTG] TTACACACCC A A'ACATAAACTfc CCAGGGAA GCAGATGTGT 750 



o 

GAGGCTGGGC TGCCCCAGCT GCTTCTCCTG ATGCCTCCAT CAGCTGAGAG 800 
TGTTCCTCAG ATGCAGCCAC ACTCTGATGG TGTTGGTAGA TGGGGAC 847 
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B cell heavy chain expression 
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B cell k light chain expression 
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B cell X light chain expression 
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Ig serum levels in 001 1 mice 
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Anti-human CD4 titers in 0011 mouse 
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Generation of a single large deletion: 
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1 20 kb 
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-4-6 kb" 



-4-6 kb 



PCK-nco 



-Y///1 Vector 1 
HSV-tk 



20-30 kb -20-30 kb 
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FIGURE 63 
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FIGURE 64A 
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human PBL stained with transgenic- 
mouse-derived fully human MAbs: 
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7494-2C1 1-8: ng/ml and Cells/mi vs. Time 
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pGP2b sequence: 



AATTAGCggccgctgtcgacaagcttcgaattcagtatcgatgtggtacctggatcctcgagtgcGGCCGCAGTATGCAA 

AAAAAAGCCCGCTCATTAGGCGGGCTCTTGGCAGAACATATCC^ 

TCTCGGGCAGCGTTGGGTCCTGGCCACGGGTGCGCATGATC 

TGCCTTACTGGTTAGCAGAATGAATCACCGATACGCGAGCG 

AGCAACAACATGAATGGTCTTCGGTTTCCGTC 

TCCGGATCTGCATCGCAGGATGCTGCTGGCTACCCTGTG^ 

TGAGTGATTTTTCTCTGGTCCCGCCGCATC^ 

ATCATCAGTAACCCGTATCGTGAGCATCCTCTCTCGTTTC 

ACGGAGGCATCAAGTGACCAAACAGGAAAAAACCGCCCTTAAC^ 

TCGAGAAACTCAACGAGCTGGACGCGGATGAACAGGCAGACATCTC 

CGCAGCTGCCTCGCGCGTTTCGGTGATGACGGTGAAAACCTCTC 

GTAAGCGGATGCCGGGAGCAGACAAGCCCGTCAGGGCGCGT^ 

AGTCACGTAGCGATAGCGGAGTGTATACTGGCTTAACTATGCGGCATC 

GGTGTGAAATACCGCACAGATGCGTAAGGAGA^^TACCGCATCAGGCGCTC 

CGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGC^ 

CGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAG^ 

GGCTCCGCCCCCCTGACGAGCATCACAAAAATCGA 

CAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTC 

CCCTTCGGGAAGCGTGGCGCTITCTCATAGCTCA^ 

GCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTA 

CACGACTTATCGCCACTGGCAGCAGCCAggc gcgc c 1 1 ggc c t aagaggc c aCTGGTAACAGGATTAGCAGAGCGAGGTA 

TGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGG^ 

TGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTG 

TTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATC 

TCAGTGGAACGAAAACTCACGTTAAGGGATTTT^ 

AAAAATCAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTO 

CCTATCTCAGCGATCTGTCTATTTCGTTCATCCA 

CTTACCATCTGGCCCCAGTGCTGCAATGATAGCGCGAGACCCAC 

CAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAAC 

AGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTC 

TGGTATGGCTTC ATTCAGCTCCGGTTCC CAACGATC 

GCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGC^ 

TCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTC 

GCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCGCCACATAGC 

TTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATC 

CCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGT^ 

GGG AATAAGGGCGACACGGAAATCTTGAATACTCAT^^ 

GTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAAT 

CCACCTGACGTCTAAGAAACCATTATTATCATGAC 

AG 



FIGURE 11 A 
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15 Sai I 
21 HinD IK 
25 BstBI 
28 EcoRI 
37 C!a I 
45 Asp718 
45 Kpn i 
52 BamHi 
57 PaeR7I 
57 Xho I 
102 Ban li 
Aatll 3047 181 Msc i 




Sfi I 1639 
Sty I 1635 
BssH II 1630 
Xcm i 1626 
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Expression of human u and y1 in HCo7 GO mice 
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RT PCR detection of human 71 and 
73 transcripts in HCo7 transgenic 
mouse spleen RNA 
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2550 11959 11952 
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HCo7 human IgG response in vitro 
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referred to above. I acknowledge the duty to disclose information which is material to the examination of this application in 
accordance with Title 37, Code of Federal Regulations, Section 1.56. I claim foreign priority benefits under Title 35, United States 
Code, Section 119 of any foreign applications(s) for patent or inventor's certificate listed below and have also identified below any 
foreign application for patent or inventor's certificate having a filing date before that of the application on which priority is claimed. 

Prior Foreign Application^) 



Country 


Application No. 


Date of Filing 


Priority Claimed 
Under 35 USC 119 








Yes_ No_ 








Yes_ No_ 



^ I hereby claim the benefit under Title 35, United States Code §1 19(e) of any United States provisional applications) listed below: 



Application No. 


Filing Date 











I claim the benefit under Title 35, United States Code, Section 120 of any United States application(s) listed below and, insofar as 
the subject matter of each of the claims of this application is not disclosed in the prior United States application in the manner 
provided by the first paragraph of Title 35, United States Code, Section 1 12, 1 acknowledge the duty to disclose material information 
as defined in Title 37, Code of Federal Regulations, Section 1.56 which occurred between the filing date of the prior application and 
the national or PCT international filing date of this application: 



Application No. 


Date of Filing 


Status 


08/544,404 


10-10-95 


_ Patented X Pending Abandoned 


08/352,322 


12-07-94 


_ Patented _X_ Pending _ Abandoned 


08/209,741 


03-09-94 


_ Patented _X_ Pending _ Abandoned 


08/165,699 


12-10-93 


_ Patented _X_ Pending _ Abandoned 


08/161,739 


12-03-93 


_ Patented X Pending Abandoned 


08/155,301 


11-18-93 


Patented _ Pending Jt_ Abandoned 
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08/096,762 


07-22-93 


_ Patented _X_ Pending _ Abandoned 


08/053,131 


04-26-93 


_ Patented _X_ Pending _ Abandoned 


07/990,860 


12-16-92 


_ Patented _X_ Pending _ Abandoned 


07/904,068 


06-23-92 


_ Patented _X_ Pending _ Abandoned 


07/853,408 


03-18-92 


_ Patented _X_ Pending _ Abandoned 


07/810,279 


12-17-91 


_ Patented _X_ Pending _ Abandoned 


07/575,962 


08-31-90 


_ Patented Pending X Abandoned 


07/574,748 


08-29-90 


_ Patented Pending X Abandoned 



POWER OF ATTORNEY: As a named inventor, I hereby appoint the following attorney(s) and/or agent(s) to prosecute this 
application and transact all business in the Patent and Trademark Office connected therewith. 



Randolph T. Apple, Reg. No. 36,429 
William M. Smith, Reg. No. 30,233 
Joe Liebeschuetz, Reg. No. 37,505 



Send Correspondence to: 


Direct Telephone Calls to: 


William M. Smith 


(Name, Reg. No., Telephone No.) 


TOWNSEND and TOWNSEND and CREW LLP 




Two Embarcadero Center, 8th Floor 


Name: Randolph T. Apple 


San Francisco, CA 94111-3834 


Reg. No.: 36,429 




Telephone: 415-326-2400 



Full Name 
of Inventor 1 



Last Name 
Lonsberg 



First Name 
Nils 



Middle Name or Initial 



Residence & 
Citizenship 



City 

Redwood City 



State/Foreign Country 
California 



Country of Citizenship 
U.S.A. 



Post Office 
Address 



Post Office Address 
930 EdgecIifFWay 



City 

Redwood City 



State/Country 
California 



Zip Code 
94061 



Full Name 
of Inventor 2 



Last Name 
Kay , 



Robert 



le Name or Initial 



Residence & 



City 



Citizenship — ~*5anJ5Vancisco 



State/Foreign Country 
California 



Country of Citizenship 



Post Office 
Address 



Post Office Address 
2127 Broadway #5 



City 

San Francisco 



State/Country 
California 



Zip Code 
94115 
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I further declare that all statements made herein of my own knowledge are true and that all statements made on information and belief 
are believed to be true; and further that these statements were made with the knowledge that willful false statements and the like so 
made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code, and that such willful 
false statements may jeopardize the* validity of the application or any patent issuing thereon. 



Signature of Inventor 1 
Nils Lonberg 


Signatoire of Inventor 2 
Robert M. Kay / 


Signature of Inventor 3 


Date 


Date /^//// 9£ 


Date 



DP.MRG 8/96 
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VERIFIED STATEMENT (DECLARATION) CLAIMING SMALL ENTITY 
(37 CFR 1.9(f) ft 1.27(c)) - SMALL BUSINESS CONCERN 




No. 014643-00M20US 



Applicant or Paten we; Nib Lonbcnr uri Rob^^t. Kav 



Application or Patent No.: 08/728.463 



Hied or Iwued: ID October. 1996 < 

Tide; TftANSGEWTC-WOtMltlMAN ANIMALS FOR PRODUCING HETEROLOGOUS AttTlBQCflBr :*g 




jthe owner of the small business concern identified below; ^ » $ 

III official of ihe sow)] business concern empowered lo act on behalf of the ooacc ^wn ti fi cd below: 

Name of Small Business Concern: Genpharm International. Tnc. - .^v-^sHh^^h--- 

Address of Small Business Concent: 297 N. Bernardo Avenue " 4. 



f hereby declare that I am: 



Mouni&in Vjew, California $4043 



I hereby declare that the above-identified small business concern qualifies as 1 small business concern as defined la 13 CFR 121 .12, and reproduced 
in 37 CFR 1.9(d), for purposes of paying reduced fees to the United States Patent and Trademark Office, » jtKsf t&^Mflfe of employees of Ok 
concern, including those of its affiliates, does not exceed 500 persons. For purposes of this staterncrit; (0 

concern is the average over the previous fiscal year of the concern of the persons employed on i fulWmc^|^ basis during each 

of the pay periods of the fiscal year, and (2) concerns are affiliates of each other when either, directly or ind^cctly, one concern controls or has 
the power to control the ©then or a third party or parties controls or has the power to control both. ^ 

I hereby declare that rights under contract or law have been conveyed to and remain with the small business got 
to the invention, entitled TRANSGENIC NON-HUMAN ANIMALS POR PRODUCING HETEROLOGOUS , 



Lonberg and Robert M. Xmv described in: 



-identified above with regard 
by i&ventor(B) N jfr 



a 
□ 



the srjeoification filed herewith. 

Application No. OR/728.463 filed 10 October. 1996. 
: No, . issued 



If the rights held by the above identified small business concern are not exclusive* each individual, 
invention is fisted below* end no rights to the invention are held by any person, other than the inventor^ wiw , 
inventor under 37 CFR 1.9(e) if that person made the invention* or by any concern that would not qualify as a ; * 
1.9(d), or a nonprofit organization Under 37 CFR 1.9(e), • 




mccrnor 



•NOTE: Separate verified statements axe required item each named person, concern or organization 
averring Co their status as small entities. (37 CFR 1.27) r *f t^f*" 



having rights in the 
as an independent 
concern under 37 CFR 



— rights to the invention 



Name 
Address 



0 Individual 



Q Small Business Concern 



0 Nonprofit Organization 



Name 
Address . 



Q Individual 



Q Small Business Concern 



0 Nonprofit Organization 



I acknowledge the duty to file, in this application or patent, notification of any change an szaawxcsidting w^aesii 
prior to paying* or at the time of paying, the earliest of the issue fee or any maintenance fee^due sjfcr^ti$i(^ 
is no longer appropriate. (37 CFR 1.28(b)). *- -^i^PifH- 



: to smell entity status 
t as a small entity 



I hereby declare that all statements made herein of my own knowledge arc true and that all statements made on information and belief are believed 
to be true; and further that chose statements were made with the knowledge that willful false statements and tee lite s*Mnade mro punishable by fine 
or imprisonment, or both, under section 1001 of Title 1 8 of the United States Code, and that socfrwiUful falite^Dtlc un ei flriii ayjcot^cy the validity 
of the application, any potent issuing thereon, or any patent to which this verified statement is directed: ^■^^■*4ff^ , *^ ; 



Jonathan MacOuiav. Ph.P, 



Name of Person Signing: 

Tide of Person if Other than Owner Chief Executive Officer 
Address of Person Signing; 855 t^TifaTw^ Avcnnc. Suite $ 



Palo Alto. C* \ fornk 94301-1103 



Signature 




Date 



Jonathan Mat 
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